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CHIMERIC OP6 POLYPEPTIDES 

Field of the Invention 
5 The invention relates generally to chimeric 

polypeptides. More particularly, the invention relates 
to chimeric polypeptides comprising a fusion of an 
osteoprotegerin dimerization domain to a heterologous 
sequence. The polypeptides may be used in a variety of 
10 diagnostic and therapeutic applications. 

Background of the Invention 

Cells recognize a variety of signals which 

15 modulate growth, differentiation and metabolism. 

Effectors of cellular functions include small molecular 
weight organic compounds, carbohydrates, amino acids, 
peptides and proteins. At present, the best understood 
signalling process employs secretion of a signalling 

20 molecule from one cell to modulate functions of other 
cells (autocrine regulation) . It has also been 
observed that secreted signalling molecules may also 
modulate the functions of cells which secrete them 
(paracrine regulation) . The ability of cells to 

25 respond to external signals usually requires that the 
appropriate receptors which bind the signalling 
molecules be present on the cell surface. Protein- 
mediated signalling between cells involves binding of 
growth factors, hormones, cytokines, cell adhesion 

30 proteins and the like to cell surface receptors. 

As a class of proteins, receptors vary in 
their structure and mode of signal transduction. They 
are characterized by having an extracellular domain 
that is involved in binding a signalling molecule and 
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cytoplasmic domain which transmits an appropriate 
intracellular signal. Receptor expression patterns 
ultimately determine which cells will respond to a 
given ligand, while the structure of a given receptor 
5 dictates the cellular response induced by ligand 
binding. Receptors have been shown to transmit 
intracellular signals via their cytoplasmic domains by 
activating protein tyrosine, or protein 
serine/threonine phosphorylation (e.g., platelet 

10 derived growth factor receptor (PDGFR) or transforming 
growth factor-p receptor-I (TGFpR-I) , by stimulating 
G-protein activation (e.g., p-adrenergic receptor), and 
by modulating associations with cytoplasmic signal 
transducing proteins (e.g., TNFR-1 and Fas/APO) 

15 (Heldin, Cell 8JD, 213-223 (1995)). 

The tumor necrosis factor receptor (TNFR) 
superfamily is a group of type I transmembrane proteins 
which share a conserved cysteine-rich motif which is 
repeated three to six times in the extracellular domain 

20 (Smith, et al . Cell 7J5, 953-962 (1994)). Collectively, 
these repeat units form the ligand binding domains of 
these receptors (Chen et al . , Chemistry 270 , 2874-2878 
(1995) ) . The ligands for these receptors are a 
structurally related group of proteins homologous to 

25 TNFa. (Goeddel et al . Cold Spring Harbor Symp. Quart. 
Biol. 51, 597-609 (1986); Nagata et al . Science 267 , 
1449-1456 (1995)). TNFa binds to distinct, but closely 
related receptors, TNFR-1 and TNFR- 2 . TNFa produces a 
variety of biological responses in receptor bearing 

3 0 cells, including, proliferation, differentiation, and 
cytotoxicity and apoptosis (Beutler et al . Ann. Rev. 
Biochem. 57, 505-518 (1988)). 

TNFa is believed to mediate acute and chronic 
inflammatory responses (Beutler et al . ibid). Systemic 

3 5 delivery of TNFa induces septic shock-like syndrome and 
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widespread tissue necrosis. Because of this, TNFa may 
be responsible for the severe morbidity and mortality 
associated with a variety of infectious diseases, 
including sepsis. Mutations in FasL, the ligand for 
5 the TNFR-related receptor Fas/APO (Suda et al . Cell 
75, 1169-1178 (1993)), is associated with autoimmunity 
(Fisher et al . Cell 81, 935-946 (1995)), while 
overproduction of FasL may be implicated in drug- 
induced hepatitis. Thus, ligands to the various TNFR- 

10 related proteins often mediate the serious effects of 
many disease states, which suggests that agents that 
neutralize the activity of these ligands would have 
therapeutic value. 

Soluble TNFR-1 receptors and antibodies that 

15 bind TNFa have been tested for their ability to 

neutralize systemic TNFa (Loetscher et al. Cancer Cells 
3,, 221-226 (1991)) . A naturally occuring form of a 
secreted TNFR-1 and TNFR-2 mRNA was recently cloned, 
and its product tested for its ability to neutralize 

20 TNFa activity in vitro and in vivo (Kohno et al . Proc . 
Natl. Acad. Sci . USA 87, 8331-8335 (1990)). The 
ability of this protein to neutralize TNFa suggests 
that soluble TNF receptors function to bind and clear 
TNF thereby blocking the cytotoxic effects on TNFR- 

25 bearing cells. 

Recombinant ly-produced TNF inhibitors have 
also been taught in the art. For example, EP 393 438 
and EP 422 339 teach the amino acid and nucleic acid 
sequences of a n 3 0kDa TNF inhibitor" (also known as a 
30 p55 receptor) and a MOkDa inhibitor" (also known as 
a p75 receptor) as well as modified forms thereof, 
e.g., fragments, functional derivatives and variants. 
EP 393 438 and EP 422 339 also disclose methods for 
isolating the genes responsible for coding the 
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inhibitors, cloning the gene in suitable vectors and 
cell types, and expressing the gene to produce the 
inhibitors. Mature recombinant 30kDa TNF inhibitor 
and mature recombinant 40kDa TNF inhibitor have 
5 been demonstrated to be capable of inhibiting TNF 
(EP 393 438 , EP 422 339, PCT Publication No. WO 
92/16221 and PCT Publication No. WO 95/34326) . 

A recently identified member of the TNFR 
family, termed Osteoprotegerin (OPG) , is a secreted 

10 polypeptide which inhibits osteoclast maturation and 
markedly increases bone density in transgenic mice 
expressing the OPG polypeptide. OPG inhibited in vitro 
the formation of mature osteoclasts from hematopoietic 
progenitor cells and reduced the extent of bone loss in 

15 ovariectomized rats (see co-owned and co-pending U.S. 
Serial Nos . 08/577,788, filed December 22, 1995; 
08/706,945, filed September 3, 1996; and 08/771,777 
filed December 20, 1996) . OPG may have benefit in the 
treatment of osteopenia. PCT Application No. 

20 W096/26217 discloses a polypeptide termed 

Osteoclastogenesis Inhibitory Factor (OCIF) which is 
identical to OPG. 

OPG comprises two domains having different 
structural and functional properties. The 

25 amino-terminal domain spanning residues 22-194 in the 
mature polypeptide shows homology to other members of 
the TNFR family, especially TNFR -2 , through 
conservation of cysteine rich domains characteristic of 
TNFR family members . The carboxy terminal domain 

30 spanning residues 194-401 has no significant homology 
to any known sequences. Unlike a number of other TNFR 
family members, OPG appears to be exclusively a 
secreted protein and does not appear to be synthesized 
as a membrane associated form. Analysis of OPG by 

3 5 reducing and non-reducing gel electrophoresis indicated 
that the full-length mature polypeptide of 3 80 amino 
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acids formed a dimer having a molecular weight of about 
12 0 kDa as compared to the monomer molecular weight of 
about 6 0 kDa. OPG polypeptides having certain 
truncations in the carboxy terminal domain or 
5 substitutions of certain cysteine residues within in 
the carboxy terminal domain formed dimeric OPG to a 
lesser extent and had lower biological activity 
compared to wild- type OPG. However, replacement of 
part or all of the OPG carboxy terminal domain with an 

10 Fc region of IgG restored biological activity in the 
OPG fusion protein to near normal levels. Based upon 
these observations, the amino- terminal region of OPG 
appeared to be required for biological activity while 
the carboxy-terminal domain was important for 

15 dimerization . In addition, the biological activity of 
OPG appeared to be enhanced when the molecule was in 
dimeric form. 

In a therapeutic regimen, it is often 
desirable to modulate a biological response either by 

20 enhancing or blocking a signal received by a receptor. 
Enhancement of a biological response can involve 
increasing the affinity of the signalling molecule for 
a receptor, or increasing the half -life of the molecule 
in circulation such that it is bound to the receptor 

25 for a longer period of time. When the signalling 

molecule is a polypeptide, enhancement of a biological 
response may be achieved by constructing analogs which 
have amino acid sequence changes that increase binding 
or half-life, derivatives (e.g., polypeptides modified 

3 0 with water soluble polymers) to increase solubility 
and/or half-life, or chimeric polypeptides (e.g, 
polypeptides fused to the Fc region of IgG) which 
increase half-life, solublility and/or modify the 
aggregation state of the protein in circulation. 

3 5 Similar approaches may be taken to develop therapeutic 
proteins which act as antagonists by blocking a 
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biological response. In particular, soluble forms of 
transmembrane receptors which may encompass part or all 
of the extracellular domains have been used to prevent 
ligand binding and receptor activation. Soluble 
5 receptors have been developed as chemically-modified 
derivatives and as chimeric polypeptides. 

Due to the relatively low inhibition of 
cytotoxicity exhibited by the 3 0kDa TNF inhibitor and 
40kDa TNF inhibitor (Butler et al . Cytokine 6, 616-623 

10 (1994) ) , various groups have generated dimers of TNF 
inhibitor proteins (Butler et al . (1994), supra ; and 
Martin et al . Exp. Neurol. 131 , 221-228 (1995)). 
However, the dimers may generate an antibody response 
(Martin et al . (1995), supra ; and Fisher et al . New 

15 Eng. J. Med., 334 , 1697-1702 (1996)). 

Generation of chimeric polypeptides has been 
described in the art. For example, construction of 
hydrid immunoglobulin molecules by fusion of a ligand 
binding partner to a human IgG chain is described in 

20 U.S. Patent Nos . 5,116,964 and 5, 428,130. 

Construction of a chimeric polypeptide comprising the 
extracellular domain of a TNF receptor fused to a mouse 
IgG heavy chain is described in U.S. Patent No. 
5,447,851. Chimeric polypeptides comprising the 

2 5 extracellular domain of a human PDGF receptor fused to 

dimerizing proteins is described in EP 0 721 983. 
Multimers of soluble forms of TNF receptors are 
described in U.S. Patent No. 5,478,92 5. 

While fusion proteins, such as those 

3 0 comprising immunoglobulin constant regions, may have 

desirable biological properties, they can elicit an 
immune response which limits their usefulness as a 
human therapeutic . 

Therefore, it is an object of the invention 
3 5 to provide chimeric polypeptides which enhance or block 
a biological response. Such polypeptides may have 
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increased stability, solubility, circulating half-life 
and decreased immunogenicity . 

It is another object of the invention to 
provide chimeric polypeptides which combine the active 
5 region of a signalling molecule with an OPG 

dimerization domain wherein said chimeric polypeptides 
will enhance or block a biological response 
characteristic of the signalling molecule portion of 
the chimera 

10 It is another object of the invention to 

provide OPG chimeric polypeptides which form dimers, 
trimers and higher multimers which may have 
advantageous properties such as increased binding 
affinity, greater stability, and longer circulating 

15 half -life compared to monomeric forms. 

Summary of the Invention 

The invention provides for chimeric 
polypeptides comprising fusions of an OPG dimerization 
20 domain to a heterologous sequence. Also provided for 
are nucleic acid sequences encoding the polypeptides, 
expression vectors and host cells for production of the 
polypeptides, and pharmaceutical compositions 
comprising the polypeptides. 

2 5 a het erologous sequence of the invention 

comprises an amino acid sequence of a cell signalling 
molecule, such as a receptor, an extracellular domain 
thereof, and an active fragment, derivative and analog 
of a receptor or an extraceullular domain. In a 

3 0 preferred embodiment, heterologous sequences are 

selected from the family of TNF-like receptors. Such 
sequences preferentially include functional 
extracellular ligand binding domains and lack 
functional transmembrane and cytoplasmic domains. In 
3 5 another embodiment, the transmembrane and cytoplasmic 
domains are deleted in whole or in part. It is 
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understood that heterologous sequences of the invention 
do not include the amino terminal region of OPG defined 
by residues 22-194 as shown in U.S. Serial No. 
08/577,788 filed December 22, 1995 and hereby 
5 incorporated by reference, and do not include related 
amino acid sequences which, when fused to an OPG 
dimerization domain, exhibit the biological activity of 
OPG. 

Also encompassed by the invention are 
10 multimeric polypeptides comprising covalently 

associated monomers of OPG chimeric polypeptides. The 
monomers may have identical heterolgous sequences or 
different heterologous sequences. In a preferred 
embodiment, the multimeric polypeptide is a dimer, 
15 either a heterodimer (different heterologous sequences) 
or a homodimer (identical heterologous sequences). 

The chimeric polypeptides of the invention 
are produced by transforming or transfecting host cells 
with nucleic acids encoding the polypeptide, culturing 
20 the host cells, and recovering the polypeptide from the 
culture. Also provided for are expression vectors and 
host cells for producing the chimeric polypeptides. 

The chimeras are useful for detecting 
molecules which interact with fused heterologous 

2 5 sequences and thereby identifying potential new 

receptors and ligands . The compositions of chimeric 
polypeptides provided herein are useful for treatment 
of a variety of disorders, for example those related to 
receptor binding. In one embodiment, compositions 

3 0 comprising TNF/OPG and TNFR/OPG chimeric are used to 

treat TNF and TNFR mediated disorders, such as 
inflammation, autoimmune diseases, and disorders 
related to excessive apoptosis 
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Description of the Figures 

Figure 1*. Amino acid sequences of human, 
mouse and rat OPG dimerization domains (residues 194- 
5 401 of corresponding full-length OPG polypeptides) . 
Conserved cysteine residues implicated in disulfide 
bond formation are underlined. 



Figure 2 . Nucleic acid and amino acid 
10 sequence of mature, full-length 30 kDa TNF inhibitor. 

Figure 3 . Nucleic acid and amino acid 
sequence of mature, full-length 40 kDa TNF inhibitor, 

15 Figure 4. Amino acid sequences of TNFbp/OPG 

chimeric polypeptides. The TNFbp portion of the 
chimera is the full-length 3 0 kDa TNF inhibitor with 
the leader sequence (underlined) and the additional 
sequence VKGTEDSGTT at the carboxy terminus . OPG 

20 dimerization domains are human OPG residues 194-401, 

196-401, 217-401, 248-401 and 304-401. The junction of 
the TNFbp and OPG sequences creates an Age I 
restriction site in the DNA sequence and adds a glycine 
codon (at position 212) . 

25 

Figure 5. Gel electrophoresis analysis of 
TNFbp/OPG chimeric polypeptides. TNFbp/OPG chimeic 
plasmids were transfected into CHO d-cells . 
supernatants from serum- free roller bottle harvests 
3 0 were analyzed on a 12% polyacrylamide , Tris-glycine, 

non-reducing gel. Dimerization patterns were compared 
to a TNFbp -Fc fusion (lane 1) and TNFbp monomer (lane 
8) . 
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Figure 6. Inhibition of TNFOt cytotoxicity on 
L929 cells. Serum-free conditioned medium samples of 
TNFbp/Fc and TNFbp/OPG [194-401] fusion polypeptides 
were serially diluted and assayed for inhibition of 
5 TNFoc cytotoxicity on L929 cells. 

Detailed Description of the Invention 

The invention provides for a chimeric 
polypeptide comprising a fusion of an OPG dimerization 

10 domain to a heterologous sequence. 

The term "heterologous sequence" refers to an 
amino acid sequence which is involved in cell 
signalling and acts to modulate cell growth, 
differentiation or metabolism. In general, 

15 heterologous sequences comprise extracellular ligand 
binding domains of cell surface receptors and their 
cognate ligands. When present as part of an OPG 
chimeric polypeptide, a heterologous sequence of the 
invention comprises about ten or more amino acids in 

2 0 length, about 20 or more amino acids in length, about 

50 or more amino acids in length, and about 100 or more 
amino acids in length. A heterologous sequence will be 
of sufficient size to confer on a chimeric polypeptide 
a functional property such as receptor binding, 

2 5 enzymatic activity, inhibitor activity and the like; 

however, it is understood that the chimeric 
polypeptides will not have functional properties 
identical to OPG although they may share one or more 
functions in common with OPG. Heterologous sequences 

3 0 may encode full-length polypeptides or active 

fragments, derivatives and analogs thereof. 

In preferred embodiments, chimeric OPG 
polypeptides include heterologous sequences encoding 
growth factors, cytokines, hormones, cell adhesion 
3 5 molecules and other polypeptide factors which are 
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typically secreted. Chimeric OPG polypeptides also 
include heterologous sequences which encode receptors 
for growth factors, cytokines, hormones, cell adhesion 
molecules, and the like, and preferably will include 
5 extracellular ligand binding domains from said 
receptors, and active fragments, derivatives and 
analogs thereof. The heterologous sequences may or may 
not be capable of forming dimers or higher aggregates 
when the sequences are present in a naturally occurring 
10 form. 

The "OPG dimerization domain" refers to that 
portion of the OPG polypeptide which is capable of 
forming covalently associated multimeric polypeptides. 
It is understood, however, that chimeric polypeptides 

15 comprising an OPG dimerization domain are not 

restricted to forming dimers, but may form higher 
multimers as well (trimers, tetramers, etc.) The 
domain may have the amino acid sequence of the human 
osteoprotegerein dimerization domain, or it may be a 

2 0 fragment, derivative or analog thereof which is capable 
of forming covalently associated multimers. More 
specifically, an OPG dimerization domain will retain 
one or more cysteine residues which will allow 
formation of at least one interchain disulfide bond. 

2 5 In a preferred embodiment, the OPG dimerization domain 

has the amino acid sequence from about residues 194 to 
401 inclusive of human OPG. 

As used herein, the term "fragment" comprises 
a deletion of one or more amino acids in a heterologous 

3 0 sequence or in an OPG dimerization domain. The 

deletion may occur at the amino terminal end, the 
carboxy terminal end or in an internal region of the 
sequence. As used herein, the term "derivative" refers 
to a modification of the polypeptide backbone of an OPG 
3 5 chimera, either within the OPG dimerization domain or 
within the heterologous sequence. Said modif icaitons 
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include, but are not limited to, attachment of water 
soluble polymers, hydrophobic moieties, fluorescent 
tags, enzymatic labels and the like. As used herein, 
the term "analogs" refers to one or more amino acid 
5 substitutions and/or insertions within a polypeptide. 
Substitutions may involve conservative replacements or 
non-conservative replacements of amino acids which are 
known to one skilled in the art. Amino acid insertions 
may occur at the amino or carboxy terminal ends of 
10 either the OPG dimerization domain or the heterolgous 
sequence or both, or may occur in internal regions. 

Polypeptides * - 

Chimeric polypeptides of the invention 

15 comprise a heterologous sequence fused at its carboxy 
terminus to the amino terminus an OPG dimerization 
domain or, alternatively, an OPG dimerization domain 
fused at its carboxy terminus to the amino terminus of 
a heterologous sequence. Chimeric polypeptides may be 

20 constructed as a direct fusion of a heterologous 
sequence and an OPG dimerization domain or may be 
constructed with a spacer or adapter region having one 
or more amino acids inserted between the two portions 
of the polypeptide. Optionally, the spacer region may 

2 5 encode a protease cleavage site. The precise site of 

the fusion is not critical and may be varied by one 
skilled in the art in order to optimize binding 
charcteristics and/or biological activity of the 
heterologous sequence . 

3 0 According to the invention, an OPG 

dimerization domain may be mammalian in origin (such as 
from mouse, rat or human) or may be a fragment or 
analog thereof which is capable of forming covalently 
associated dimers or higher order multimers. The amino 
3 5 acid sequences of rat, mouse and human OPG dimerization 
domains span from about residues 194-401 of their 



* • 
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respective full-length OPG polypeptides as shown in 

Figure 1 (SEQ ID NO: ) . Fragments and analogs of an 

OPG dimerization domain include: deletion or 
substitution of a cysteine residue at any of positions 
5 195, 202, 277, 319 and 400; addition of one or more 
cysteine residues; rearrangement of the configuration 
of cysteine residues which may entail a net increase 
from, a net decrease from, or no change in the number 
of cysteine residues compared to residues 194-401 of 

10 the human OPG dimerization domain; amino-terminal 

truncations of OPG [194-401] , e.g, 195-401, 196-401, and 
so forth; C-terminal truncations of OPG [194-401] , e.g, 
194-400, 194-399, and so forth; conservative 
substitutions of amino acid residues in OPG [194-401] 

15 wherein the substitutions comprise replacements with 

structurally or functionally similar amino acids which 
are known to one skilled in the art; and any 
combinations thereof . 

Heterologous sequences which form part of a 

2 0 chimeric OPG polypeptide include receptors having known 
extracellular ligand binding domains. Examples are 
receptor protein- tyrosine kinases, such as the 
platelet-derived growth factor receptor (PDGFR) family, 
fibroblast growth factor receptor (FGFR) family, 

25 insulin receptor family, epidermal growth factor 

receptor (EGFR) family, nerve growth factor { NGFR ) 
family, hepatocyte growth factor family (HGFR) , EPH 
family, AXL family, TIE family, DDR family, ROR family, 
and other receptor protein tyrosine kinases (see van 

30 der Geer et al . Ann. Rev. Cell Biol. 10, 251-337 
(1994)). Other examples of receptors having 
extracellular ligand binding domains include the 
cytokine receptor superfamily, such as G-CSF, GM-CSF (a 
andp subunits) , MGF, EPO, MGDF, IL-1, IL-2, IL-3, IL- 

35 4, IL-5, IL-6, IL-7, IL-9, IL-11, growth hormone, a- 
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interferon, p-interf eron, and y-interferon receptors, 
the seven transmembrane domain receptor superfamily, 
such as acetylcholine, adrenergic, dopamine, thrombin, 
FSH, gonadotropin, thyrotropoin, clacitonin and 
5 parathyroid hormone receptors, and cell adhesion 

receptors. It is understood that the receptors cited 
herein are merely examples and that heterologous 
sequences present in OPG chimeric polypeptides are not 
limited to the above-mentioned receptors. 

10 Other heterologous sequences of the invention 

comprise growth factors, hormones, cytokines, cell 
adhesion proteins and the like. Also included are 
corresponding ligands for the receptor protein tyrosine 
kinases, ligands for cytokine receptors, ligands for 

15 seven transmembrane domain receptors, and ligands for 
cell adhesion receptors. 

In a preferred embodiment, the heterologous 
sequence is a member of the TNF receptor superfamily or 
is derived from a member of the TNF receptor family. 

2 0 Members include TNFR-1, TNFR-2, TNFrp, NGFR, FasB, 

CD40, 0X4 0, CD27, CD30, and 4-lBB. Typically the 
extracellular domains of TNF receptors, or active 
fragments, derivatives and analogs thereof, are fused 
to an OPG dimerization domain. Active fragments of TNF 
25 receptors will have at least one cysteine rich domain, 
alternatively two, three or four cysteine rich domains, 
or alternatively one, two or three cysteine rich 
domains and a portion thereof, for example, two 
cysteine rich domains and a portion of a third domain. 

3 0 Activity of a TNF /OPG chimeric polypeptide may include 

biological activity or ligand binding activity 
characteristic of a TNF family member which may be 
evaluated using procedures known to one skilled in the 
art . 

3 5 Preferred heterologous sequences 

comprise TNFR-1 or are derived from TNFR-1, and may be 
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a 30kDa TNF inhibitor, a 40 kDa TNF inhibitor, or a 
functionally active low molecular weight TNF inhibitor. 
The nucleic acid and amino acid sequence of mature, 
full-length 30kDa TNF inhibitor is shown in Figure 2 

5 (SEQ ID NO: ) . The nucleic acid and amino acid 

sequence of mature, full-length 40kDa TNF inhibitor is 

shown in Figure 3 (SEQ ID NO: ) . The low molecular 

weight TNF inhibitors are modified forms of the 30kDa 
TNF inhibitor and 40 kDa TNF inhibitor which do not 
10 contain the fourth domain (amino acid residues Thr 127 - 
Thr 161 of the 3 0kDa TNF inhibitor and amino acid 

residues Pro 141 -Thr 179 of the 40kDa TNF inhibitor) ; a 
portion of the third domain (amino acid residues 
Asn ul -Cys 126 of the 30kDa TNF inhibitor and amino acid 

15 residues Pro 123 -Lys 140 of the 40kDa TNF inhibitor) ; and, 
optionally, which do not contain a portion of the first 
domain (amino acid residues Asp 1 -Lys 21 of the 30kDa TNF 
inhibitor and amino acid residues Leu 1 -Lys 34 of the 
40kDa TNF inhibitor) . 

20 The heterologous sequences of the present 

invention include derivatives of TNFR-1 proteins 
represented by the formula Ri- [Cys 19 -Cys 103 ] -R 2 and 
R4- [Cys 32 -Cys 112 ] -R5 . These proteins are deletion 
variants of the 3 0kDa TNF inhibitor and the 40kDa TNF 

25 inhibitor, respectively, and are referred to as 
" truncated TNFbp ( s ) " . 

By "Ri- [Cys 19 -Cys 103 ] -R2" is meant one or more 

proteins wherein [Cys 19 -Cys 103 ] represents residues 19 
through 103 of mature, full-length 30kDa TNF inhibitor, 
3 0 the amino acid residue numbering scheme of which is 

provided in Figure 2 (SEQ ID NO: ) to facilitate the 

comparison; wherein Ri represents a methionylated or 

nonmethionylated amine group of Cys 19 or of amino- 
terminus amino acid residue (s) selected from the group: 



it 
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C 
IC 
SIC 



NSIC 


(SEQ 


ID 


NO 


NNSIC 


(SEQ 


ID 


NO 


QNNSIC 


(SEQ 


ID 


NO 


PQNNSIC 


(SEQ 


ID 


NO 


HPQNNSIC 


(SEQ 


ID 


NO 


IHPQNNSIC 


(SEQ 


ID 


NO 


Y I HPQNNSIC 


(SEQ 


ID 


NO 


KYI HPQNNSIC 


(SEQ 


ID 


NO 


GKY I HPQNNS I C 


(SEQ 


ID 


NO 


QGKY I H PQNNS I C 


(SEQ 


ID 


NO 


PQGKY I H PQNNS I C 


(SEQ 


ID 


NO 


C PQGKY IH PQNNS I C 


(SEQ 


ID 


NO* 


VCPQGKYIHPQNNSIC 


(SEQ 


ID 


NO 


S VC PQGKYIHPQNNS I C 


(SEQ 


ID 


NO 


DS VC PQGKY I HPQNNS I C 


(SEQ 


ID 


NO 



and wherein R2 represents a carboxy group of Cys 103 or 

of carboxy- terminal amino acid residues selected from 
5 the group: 



F 
FC 
FCC 
FCCS 
FCCSL 
FCCSLC 
FCCSLCL 



(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 



) ; 



and variants thereof . 

Exemplary tumor necrosis factor binding 
10 proteins which comprise TNFbp/OPG chimeric polypeptides 
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of the present invention include the following 
molecules: NH2 -MDSVCPQGKYIHPQNNSIC- [Cys 19 -Cys 103 ] - 
FC-COOH (also referred to as 30kDa TNFbp 2.6C105); 
NH 2 -MDS VC PQGKY IHPQNNS I C - [Cys 19 -Cys 103 ] -FNCSL-COOH 
5 (also referred to as 30kDa TNFbp 2.6C106); 

NH 2 -MDSVCPQGKYIHPQNNSIC- [Cys 19 -Cys 103 ] -FNCSL-COOH (also 
referred to as 30kDa TNFbp 2.6N105); NH2- 

MYIHPQNNSIC- [Cys 19 -Cys 103 ] -FNCSL-COOH (also referred to 
as 30kDa TNFbp 2.3d8); NH 2 -M- [Cys 19 -Cys 103 ] -FNCSL-COOH 

10 (also referred to as 30kDa TNFbp 2.3dl8); and 

NH2-MSIS- [Cys 19 -Cys 103 ] -FNCSL-COOH (also referred to as 
30kDa TNFbp 2.3dl5), either methionylated or 
nonmethionylated, and variants and derivatives thereof. 

By "R4- [Cys 32 -Cys 112 ] -Rs M is meant one or more 

15 proteins wherein [Cys 32 -Cys 112 ] represents residues 

Cys 32 through Cys 112 of mature, full-length 40kDa TNF 
inhibitor, the amino acid residue numbering scheme of 

which is provided in Figure 3 (SEQ ID NO: ) to 

facilitate the comparison; wherein R4 represents a 

2 0 methionylated or nonmethionylated amine group of Cys 32 

or of amino- terminus amino acid residue (s) selected 
from the group: 

C 
MC 
QMC 

AQMC (SEQ ID NO 

TAQMC (SEQ ID NO 

QTAQMC (SEQ ID NO 

DQTAQMC (SEQ ID NO 

YDQTAQMC (SEQ ID NO 

YYDQTAQMC (SEQ ID NO 

EY YDQTAQMC (SEQ ID NO 

REYYDQTAQMC (SEQ ID NO 

LREYYDQTAQMC (SEQ ID NO 
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RLREYYDQTAQMC 


(SEQ 


ID 


NO 


CRLREYYDQTAQMC 


(SEQ 


ID 


NO 


* TCRLREYYDQTAQMC 


(SEQ 


ID 


NO 


STCRLREYYDQTAQMC 


(SEQ 


ID 


NO 


GSTCRLREYYDQTAQMC 


(SEQ 


ID 


NO 


PGSTCRLREYYDQTAQMC 


(SEQ 


ID 


NO 


EPGSTCRLREYYDQTAQMC 


(SEQ 


ID 


NO 


PEPGSTCRLREYYDQTAQMC 


(SEQ 


ID 


NO 


APEPGSTCRLREYYDQTAQMC 


(SEQ 


ID 


NO 


YAPEPGSTCRLREYYDQTAQMC 


(SEQ 


ID 


NO 


PYAPEPGSTCRLREYYDQTAQMC 


(SEQ 


ID 


NO 


TPYAPEPGSTCRLREYYDQTAQMC 


(SEQ 


ID 


NO 


F T P Y A P E PG S TC RLREYYDQTAQMC 


(SEQ 


ID 


NO 


AFTPYAPEPGSTC RLREYYDQTAQMC 


(SEQ 


ID 


NO 


VAFTP YAPE PGS TC RLREYYDQTAQMC 


(SEQ 


ID 


NO 


QVAFTPYAPEPGSTCRLREYYDQTAQMC 


(SEQ 


ID 


NO 


AQVAFTPYAPEPGSTCRLREYYDQTAQMC 


(SEQ 


ID 


NO 


PAQVAFTPY APE PGSTCRLREYYDQTAQMC 


(SEQ 


ID 


NO 


LPAQVAFTPYAPEPGSTCRLREYYDQTAQMC 


(SEQ 


ID 


NO 



and wherein R5 represents a carboxy group of Cys 112 or 

of carboxy-terminal amino acid residues selected from 
the group : 



R 
RL 
RLC 
RLCA 
RLCAP 
RLCAPL 
RLCAPLR 
RLCAPLRK 
RLCAPLRKC 
RLCAPLRKCR 




and variants thereof 
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As shown in Example 1, a hybrid DNA molecule 
encoding TNFbp 4*0, the full-length 30 kDa TNF 
inhibitor (Figure 2) with the additional sequence 
VKGTEDSGTT extending from the carboxy terminus, and 
5 human OPG [194-401] was constructed. The resulting 

chimeric polypeptide, termed TNFbp/OPG [194-401] has the 
amino acid sequence as shown in Figure 4. Upon 
expression, the mature chimeric polypeptides formed 
dimers in conditioned medium of transfected host cells 

10 as determined by non-reducing SDS-PAGE (see Figure 5) . 
Additional TNFbp fusions were constructed to amino 
terminal truncations of the human OPG dimerization 
domain. These constructs are designated TNFbp/OPG [ 196- 
401], TNFbp/OPG [217-401] , TNFbp/OPG [248-401] , and 

15 TNFbp/OPG [304-401] and the amino acid sequences are 
shown in Figure 4. OPG [194-401] has the full 
complement of five cysteine residues which are involved 
in covalent association of OPG dimerization domains. 
OPG [196-401] lacks one cysteine residue at position 

20 195, OPG[217-401] and OPG [248-401] lacks a second 
cysteine residue at position 202, and OPG[304-401] 
lacks a third cysteine residue at position 277 (see 
Figure 1 for location of cysteine residues) . The 
chimeric polypeptides produced in conditioned medium of 

25 transfected CHOd- host cells were analyzed by 
non-reducing SDS-PAGE (Figure 5) . In the L929 
cytotoxicity assay, the TNFbp/OPG [194-401] chimera 
showed activity similar to a TNFbp / Fc chimera (Figure 
6) . 

30 The invention also provides for chimeric OPG 

polypeptides which form multimers (i.e., dimers, 
trimers and higher multimers) . Multimers of the 
invention comprise covalently associated monomeric OPG 
chimeras wherein the monomers may have identical 

35 heterologous sequence or different heterologous 

sequences. Preferably, the chimeric polypeptides are 
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dimers or trimers . Preparations of multimeric 
polypeptides will be essentially free of monomeric OPG 
chimeras which are not covalently associated and of 
inactive multimers. Such preparations are made using 
5 techniques available to one skilled in the art 

Modifications of chimeric OPG polypeptides 
are encompassed by the invention and include 
post-translational modifications (e.g., N-linked or 
O-linked carbohydrate chains, processing of N-terminal 

10 or C-terminal ends), attachment of chemical moieties to 
the amino acid backbone, chemical modifications of 
N-linked or O-linked carbohydrate chains, and addition 
of an N-terminal methionine residue as a result of 
procaryotic host cell expression. The polypeptides may 

15 also be modified with a detectable label, such as an 
enzymatic, fluorescent, isotopic or affinity label to 
allow for detection and isolation of the protein. 

Also provided by the invention are chemically 
modified derivatives of OPG which may provide 

2 0 additional advantages such as increased solubility, 

stability and circulating time of the polypeptide, or 
decreased immunogenic ity (see U.S. Patent No. 
4,179,337). The chemical moieties for derivitization 
may be selected from water soluble polymers such as 
25 polyethylene glycol, ethylene glycol /propylene glycol 
copolymers, carboxymethylcellulose , dextran, polyvinyl 
alcohol and the like. The polypeptides may be modified 
at random positions within the molecule, or at 
predetermined positions within the molecule and may 

3 0 include one, two, three or more attached chemical 

moieties . 

The polymer may be of any molecular weight, 
and may be branched or unbranched. For polyethylene 
glycol, the preferred molecular weight is between about 
3 5 lkDa and about lOOkDa (the term "about 11 indicating that 
in preparations of polyethylene glycol, some molecules 
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will weigh more, some less, than the stated molecular 
weight) for ease in handling and manufacturing. Other 
sizes may be used, depending on the desired therapeutic 
profile (e.g., the duration of sustained release 
5 desired, the effects, if any on biological activity, 
the ease in handling, the degree or lack of 
antigenicity and other known effects of the 
polyethylene glycol to a therapeutic protein or 
analog) . 

10 The polyethylene glycol molecules (or other 

chemical moieties) should be attached to the protein 
with consideration of effects on functional or 
antigenic domains of the protein. There are a number 
of attachment methods available to those skilled in the 

15 art, e.g. EP 0 401 384 herein incorporated by reference 
(coupling PEG to G-CSF) , see also Malik et al . Exp. 
Hematol. 20, 1028-1035 (1992) (reporting pegylation of 
GM-CSF using tresyl chloride) . For example, 
polyethylene glycol may be covalently bound through 

20 amino acid residues via a reactive group, such as, a 
free amino or carboxyl group. Reactive groups are 
those to which an activated polyethylene glycol 
molecule may be bound. The amino acid residues having 
a free amino group may include lysine residues and the 

25 N- terminal amino acid residues; those having a free 
carboxyl group may include aspartic acid residues 
glutamic acid residues and the C-terminal amino acid 
residue. Sulfhydrl groups may also be used as a 
reactive group for attaching the polyethylene glycol 

30 molecule(s). Preferred for therapeutic purposes is 

attachment at an amino group, such as attachment at the 
N- terminus or lysine group. 

One may specifically desire N-terminally 
chemically modified protein. Using polyethylene 

3 5 glycol as an illustration of the present compositions, 
one may select from a variety of polyethylene glycol 
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molecules (by molecular weight, branching, etc.)/ the 
proportion of polyethylene glycol molecules to protein 
(or peptide) molecules in the reaction mix, the type of 
pegylation reaction to be performed, and the method of 
5 obtaining the selected N-terminally pegylated protein. 
The method of obtaining the N-terminally pegylated 
preparation (i.e., separating this moiety from other 
monopegylated moieties if necessary) may be by 
purification of the N-terminally pegylated material 

10 from a population of pegylated protein molecules. 

Selective N-terminal chemically modification may be 
accomplished by reductive alkylation which exploits 
differential reactivity of different types of primary 
amino groups (lysine versus the N-terminal) available 

15 for derivatization in a particular protein. Under the 
appropriate reaction conditions, substantially 
selective derivatization of the protein at the 
N- terminus with a carbonyl group containing polymer is 
achieved. 

2 0 The chimeric OPG polypeptides of the 

invention are isolated and purified from other 
constituents present in lysates or supernatant s of host 
cells expressing the polypeptides. In one embodiment, 
the polypeptide is free from association with other 

2 5 human proteins, such as the expression product of a 

bacterial host cell. Also provided by the invention is 
a method for the purification of OPG chimeric 
polypeptides. The purification process may employ one 
or more standard protein purification steps in an 

30 appropriate order to obtain purified protein. The 
chromatography steps can include ion exchange, gel 
filtration, hydrophobic interaction, reverse phase, 
chromatof ocusing, affinity chromatography employing an 
anti-OPG antibody or biotin-streptavidin affinity 

35 complex and the like. When preparations of selected 
multimeric OPG chimeras are desired, the purification 
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method may be carried out to separate species of 
different aggregation states, for example, separation 
of monomeric from dimeric OPG chimeras, or separation 
of dimeric from tetrameric OPG chimeras. 
5 Chimeric OPG polypeptides may be used in 

assays to screen for binding molecules. Examples of 
such molecules include, but are not limited to, nucleic 
acids, polypeptides, small molecular weight peptides, 
carbohydrates, lipids and small molecular weight 

10 organic compounds. Assays will employ combining 

candidate molecules (either purified or unpurified) 
with chimeric OPG polypeptides under conditions that 
allowing binding; and measuring the extent of binding 
to the chimeric polypeptide. Binding measurements are 

15 made using detection systems available to one skilled 
in the art, such as radioactivity, enzymatic activity, 
fluorescence, and surface plasmon resonance. 

Nucleic Acids 

2 0 The invention provides for an isolated 

nucleic acid encoding a chimeric polypeptide having an 
OPG dimerization domain fused to a heterologous 
sequence. The nucleic acids encode a chimeric OPG 
polypeptide wherein the heterologous sequence is a cell 
25 signalling molecule such as a receptor or a receptor 
ligand. In a preferred embodiment, the heterologous 
nucleic acid sequence encodes a polypeptide of the TNFR 
family, or a fragment, derivative or analog thereof, 
provided however that the heterologous nucleic acid 

3 0 sequence does not encode OPG [22-194] as shown in U.S. 

Serial No. 08/577,788 filed December 22, 1995, or a 
homologous sequence which, when fused to an OPG 
dimerization domain, has the biological activity of 
OPG. 

3 5 The nucleic acids of the invention encode 

chimeric OPG polypeptides selected from the following: 
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a) the nucleic acid sequences which encode the 

polypeptides shown in Figure 1 (SEQ ID NO: ) or 

complementary strands thereof; and 

b) the nucleic acids sequences which hybridize 

5 under high stringency conditions with the sequences in 
(a), and degenerate sequences thereof, 

provided however that the polypeptides do not have the 
biological activity of OPG . Nucleic acids encoding OPG 
chimeric polypeptides may hybridize over part or all of 
10 the nucleic acid sequences encoding the OPG 

dimerization domains shown in Figure 1 (SEQ ID NO: 

) . 

The conditions for hybridization are 
generally of high stringency using temperatures, 

15 solvents and salt concentrations wherein the 

hydridizing sequences are about 12-2 0°C below the 
melting temperature (T m ) of the perfectly matched 
duplex. Equivalent stringency to these conditions may 
be readily ascertained by one skilled in the art by 

20 adjusting salt and organic solvent concentrations and 
temperature. Specific hybridization conditions are 
described in Sambrook et al . Molecular Cloning: A 
Laboratory Manual . 2nd ed. Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (1989) 

25 Preferred sequences include nucleic acids 

which encode chimeric OPG polypeptides having rat, 
mouse and human OPG dimerization domains. DNA encoding 
human OPG dimerization domain was provided in a full- 
length human OPG plasmid designated pRcCMV - human OPG 

30 and deposited with the American Type Culture 

Collection, Rockville, MD on December 27, 1995 under 
accession no. 69969. DNA encoding rat OPG dimerization 
domain was provided in a full-length rat OPG plasmid 
designated pMOB-Bl . 1 and deposited with the American 

35 Type Culture Collection, Rockville, MD on December 27, 
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1995 under ATCC accession no. 69970. DNA encoding 
mouse OPG dimerization domain was provided in a full- 
length mouse OPG plasmid designated pRcCMV -murine OPG 
and deposited with the American Type Culture 
5 Collection, Rockville, MD on December 27, 1995 under 
accession no. 69971. The nucleic acids of the 
invention will hybridize under stringent conditions to 
the DNA inserts of ATCC accession nos . 69969, 69970, 
and 69971. 

10 In a preferred embodiment, heterologous 

sequences will comprise nucleic acids encoding TNFR-1, 
and fragments, derivatives and analogs thereof, such as 
the TNF 30kDa inhibitor or TNF 40kDa inhibitor. 
Presently preferred heterologous sequences include 

15 those nucleic acids encoding 30kDa TNFbp 2.6C105, 3 0kDa 
TNFbp 2.6C106, 30kDa TNFbp 2.6N105, 30kDa TNFbp 2.3d8, 
3 0kDa TNFbp 2 . 3dl8 and 3 0kDa TNFbp 2.3dl5. 

Also provided by the invention are nucleic 
acids encoding variants of an OPG chimeric polypeptide 

20 wherein the variations may be in the heterologous 

sequence or the OPG dimerization domain or both. The 
nucleic acid derivatives comprise addition, 
substitution, insertion or deletion of one or more 
nucleotides such that the resulting sequences encode 

25 chimeric OPG polypeptides comprising one or more amino 
acid residues which have been added, deleted, inserted 
or substituted in either the heterologous sequence or 
the OPG dimerization domain or both. The nucleic acid 
derivatives may be naturally occurring, such as by 

3 0 splice variation or polymorphism, or may be constructed 
using site-directed mutagenesis techniques available to 
the skilled worker. Chimeric OPG polypeptide variants 
are described in the previous section entitled 
"Polypeptides" and it is anticipated that nucleic acids 

35 encoding all variants disclosed therein, and degenerate 
molecules thereof, are encompassed by the invention. 
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Examples of the nucleic acids of the 
invention include cDNA, genomic DNA, synthetic DNA and 
RNA. cDNA is obtained from libraries prepared from 
mRNA isolated from various tissues expressing OPG. In 
5 humans, tissue sources for OPG include kidney, liver, 
placenta and heart. Genomic DNA encoding OPG is 
obtained from genomic libraries which are commercially 
available from a variety of species. Synthetic DNA is 
obtained by chemical synthesis of overlapping 

10 oligonucleotide fragments followed by assembly of the 
fragments to reconstitute part or all of the coding 
region and flanking sequences (see U.S. Patent No. 
4,695,623). RNA may be obtained in large quantities 
use of procaryotic expression vectors which direct 

15 high-level synthesis of mRNA, such as vectors using T7 
promoters and RNA polymerase. 

Nucleic acid sequences of the invention are 
useful for the expression of chimeric OPG polypeptides. 
Expression may be carried out in transfected host cells 

20 for production of recombinant protein in quantities 

sufficient for diagnostic or therapeutic applications. 
In addition, chimeric OPG polypeptides may be expressed 
in vivo and secreted into the circulation to provide 
therapeutic benefit. 

25 

Vectors and Host Cells 

Expression vectors containing nucleic acid 
sequences encoding OPG fusion proteins, host cells 
transformed with said vectors and methods for the 

30 production of OPG fusion proteins are also provided by 
the invention. An overview of expression of 
recombinant proteins is found in Methods of Enzvmoloav 
v. 185, Goeddel, D.V. ed. Academic Press (1990). 

Host cells for the production of OPG fusion 

3 5 proteins include procaryotic host cells, such as 

E. coli . yeast, plant, insect and mammalian host cells. 
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E. coli strains such as HB101 or JM101 are suitable for 
expression. Preferred mammalian host cells include 
COS, CHOd-, 293, CV-1, 3T3, baby hamster kidney (BHK) 
cells and others. Mammalian host cells are preferred 
5 when post- translational modifications, such as 

glycosylation and polypeptide processing, are important 
for OPG chimera activity. Mammalian expression allows 
for the production of secreted polypeptides which may 
be recovered from the growth medium. 

10 Vectors for the expression of OPG chimeric 

polypeptides contain at a minimum sequences required 
for vector propogation and for expression of the cloned 
insert. These sequences include a replication origin, 
selection marker, promoter, ribosome binding site, 

15 enhancer sequences, RNA splice sites and transcription 
termination site. Vectors suitable for expression in 
the aforementioned host cells are readily available and 
the nucleic acids of the invention are inserted into 
the vectors using standard recombinant DNA techniques. 

20 Vectors for tissue-specific expression of OPG chimeric 
polypeptides are also included. Such vectors include 
promoters which function specifically in liver, kidney 
or other organs for production in mice, and viral 
vectors for the expression of OPG in targeted human 

25 cells. 

Using an appropriate host -vector system, OPG 
chimeric polypeptides are produced recombinantly by 
culturing a host cell transformed with an expression 
vector containing nucleic acid sequences encoding an 

3 0 OPG chimeric polypeptide under conditions such that the 
polypeptide is produced, and isolating the product of 
expression. OPG chimeras are produced in the 
supernatant of transfected mammalian cells or in 
inclusion bodies of transformed bacterial host cells. 

3 5 OPG chimeras so produced may be purified by procedures 
known to one skilled in the art as described below. 



WO 98/49305 PCT/US98/08631 



28 

Expression vectors for mammalian hosts are exemplified 
by plasmids such as pDSRct described in PCT Application 
No. 90/14363; see also Methods in Enzymology vol. 185, 
D.V. Goeddel, ed. pp. 487-511 for additional examples. 
5 A variety of expression vectors are available for 

bacterial host cells and are described in Methods in 
Enzvmoloav, ibid. pp. 14-37 and references cited 
therein. It is anticipated that the specific plasmids 
and host cells described are for illustrative purposes 
10 and that the choice of any specific plasmid and host 

cell for expression of an OPG chimeric polypeptide will 
depend upon consideration of a variety of factors by 
one skilled in the art. 

15 Antibodies 

Also encompassed by the invention are 
antibodies specifically binding to an OPG chimeric 
polypeptide. Antigens for the generation of antibodies 
may be full-length polypeptides or peptides spanning a 

2 0 portion of the OPG sequence. Immunological procedures 

for the generation of polyclonal or monoclonal 
antibodies reactive with OPG are known to one skilled 
in the art (see, for example, Harlow and Lane, 
Antibodies: A Laboratory Manual Cold Spring Harbor 

25 Laboratory Press, Cold Spring Harbor N.Y. (1988)). 

Antibodies so produced are characterized for binding 
specificity and epitope recognition using standard 
enzyme-linked immunosorbent assays. Antibodies also 
include chimeric antibodies having variable and 

30 constant domain regions derived from different species. 
In one embodiment, the chimeric antibodies are 
humanized antibodies having murine variable domains and 
human constant domains. Also encompassed are 
complementary determining regions grafted to a human 

3 5 framework (so-called CDR-grafted antibodies) . Chimeric 
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and CDR-grafted antibodies are made by recombinant 
methods known to one skilled in the art. Also 
encompassed are human antibodies made in mice. 

Anti-OPG chimera antibodies of the invention 
5 may be used as an affinity reagent to purify OPG from 
biological samples. In one method, the antibody is 
immobilized on CNBr- activated Sepharose and a column of 
antibody-Sepharose conjugate is used to remove OPG from 
liquid samples. Antibodies are also used as diagnostic 
10 reagents to detect and quantitate OPG in biological 
samples by methods described below. 

Pharmaceutical compositions 

The invention also provides for 

15 pharmaceutical compositions comprising a 

therapeutically effective amount of an OPG chimeric 
polypeptide together with a pharmaceutical ly acceptable 
diluent, carrier, solubilizer, emulsifier, preservative 
and/or adjuvant. The term "therapeutically effective 

2 0 amount" refers to an amount which provides a 

therapeutic effect for a specified condition and route 
of administration. The composition may be in a liquid 
or lyophilized form and comprises a diluent (Tris, 
acetate or phosphate buffers) having various pH values 

2 5 and ionic strengths, solubilizer such as Tween or 

Polysorbate, carriers such as human serum albumin or 
gelatin, preservatives such as thimerosal or benzyl 
alcohol, and antioxidants such as ascrobic acid or 
sodium metabisulf ite . Also encompassed are 

3 0 compositions comprising OPG chimeric polypeptides 

modified with water soluble polymers to increase 
solubility or stability. Compositions may also 
comprise incorporation of OPG chimeric polypeptides 
into liposomes, microemulsions , micelles or vesicles 
3 5 for controlled delivery over an extended period of 
time. Selection of a particular composition will 
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depend upon a number of factors, including the 
condition being treated, the route of administration 
and the pharmacokinetic parameters desired. A more 
extensive survey of components suitable for 
5 pharmaceutical compositions is found in Remington ' s 
Pharmaceutical Sciences . 18th ed. A. R. Gennaro, ed. 
Mack, Easton, PA (1980) . 

Compositions of the invention may be 
administered by injection, either subcutaneous, 
10 intravenous or intramuscular, or by oral, nasal, 
pulmonary or rectal administration. The route of 
administration eventually chosen will depend upon a 
number of factors and may be ascertained by one skilled 
in the art . 

15 Pharmaceutical compositions of chimeric OPG 

polypeptides are useful for treatment of receptor- 
mediated disorders, for example disorders resulting 
from the function (or lack thereof) of protein tyrosine 
kinases, cytokine, seven transmembrane domain, and cell 

20 adhesion receptors. Disorders resulting from the 
function (or lack thereof) of the corresponding 
polypeptide ligands of the above referenced receptors 
may also be treated. In one embodiment, compositions 
comprising TNF/OPG chimeras are used to treat 

2 5 TNF-related disorders such as inflammation, autoimmune 

diseases and conditions marked by excessive apoptosis. 
Chimeras of the invention may act as agonists to 
stimulate receptor activation and associated changes in 
cell activity, or chimeras may be antagonists which 
30 block receptor function. 

The invention also provides for 
pharmaceutical compositions comprising a 
therapeutically effective amount of the nucleic acids 
of the invention together with a pharmaceutical ly 

3 5 acceptable adjuvant. Nucleic acid compositions will be 
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suitable for delivery to cells and tissues as part of 
an anti-sense or gene therapy regimen. 

The following examples are offered to more 
fully illustrate the invention, but are not construed 
5 as limiting the scope thereof. 

EXAMPLE 1 

Construction and Expression of TNFbp/OPG fusion 

proteins 

10 

The TNFbp/OPG [196-401] chimeric gene was 
prepared in a two step PCR process. A first round of 
PCR was designed to produce overlapping PCR products 
from each gene. The templates used were plasmids 
15 p2302, containing the gene encoding TNFbp 4.0 (Figure 
4) fused to the Fc region of human IgGl, and plasmid 
pRcCMV-human OPG (ATCC accession no. 69969), containing 
the gene for human OPG. The PCR products were gel 
purified and used as a template to create the chimeric 

2 0 gene. Primers used for the PCR reactions are as 

follows: 1275-51 (containing a 5' Xbal site, consensus 
Kozak and the start of the hTNFbp gene) and 13 68-82 
(containing a portion of OPG cDNA, an Agel site and the 
3' end of the human TNFbp 4.0 sequence) were used to 

25 amplify the TNFbp gene from p2302; 1368-83 (containing 
the 3' end of TNFbp, an Agel site and the 5' end of the 
hOPG C-terminal domain) and 1295-27 (containing a Sail 
site and the 3' end of the OPG cDNA) were used to 
amplify the OPG [196-401] gene from pRcCMV-human OPG. A 

30 second PCR reaction used primers 1275-51 and 1295-27 to 
generate the chimeric gene. 

The PCR product was cut with Xbal /Sal I and 
subcloned into the pDSRoc2 expression vector to give 
plasmid p389-l. The expression cassette contains a 

3 5 SV40 early promoter driving the expression of the 

chimeric gene and also includes an SV40 late intron, an 
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HTLV translation enhancing signal and an tx2-FSH 
polyadenylation signal (DeClerck, et aL J. Biol. Chem. 
266 , 3893-3899 (1991)). The pDSRa2 vector also 
contains a DHFR cassette for selection in CHO d- cells. 

5 

Primer Sequences: 

1275-51: 

(SEQ ID NO: ) 

10 5'-CGC TCTAGA CCACC ATG GGC CTC TCC ACC GTG-3 ' 

Xbal Kozak M G L S T V 

13 68-82: 

(SEQ ID NO: ) 

15 5 ' -ACACAGGGTAACATCTAT ACCGGT GGTGCCTGAGTCCTCAG-3 ' 

hOPG C- terminus Agel hTNFbp 

1368-83 : 

(SEQ ID NO: ) 

20 5 ' -CTGAGGACTCAGGCACC ACCGGT ATAGATGTTACCCTGTG-3 ' 

EDSGT TG IDVTL 

TNFbp Agel hOPG C-terminus 

1295-27: 

25 (SEQ ID NO: ) 

5'-CCTCT GTCGAC TA TTA TAA GCA GCTTATTTTCACGGATTG-3 ' 

Sail * * L C... OPG--> 

Other constructs with truncated OPG 
30 dimerization doamins were created as follows: 



The primer pair for OPG [194-401] was 1295-27 and 
1428-89. 
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1428-89 : 

(SEQ ID NO: ) 

TCA ACCGGT AAA TGT GGA ATA GAT GTT AC 
5 Agel K C G I D V T 

The primer pair for OPG [217-401] was 1295-27 and 
1388-50 • 

10 1388-50: 

(SEQ ID NO: ) 

GTTT ACCGGT CCT AAC TGG CTT AGT GTC 
Agel P N W L S V 

15 The primer pair for OPG [248-401] was 1295-27 and 
1388-51 . 

1388-51 : 

(SEQ ID NO: ) 

20 AGC ACCGGT GAA CAG ACT TTC CAG CTG 

Agel E Q T F Q L 

The primer pair for OPG [304-401] was 1295-27 and 
1388-52 . 

25 

1388-52 : 

(SEQ ID NO: ) 

GGAA ACCGGT CCG GGA AAG AAA GTG GG 
Agel P G K K V G 

30 

The corresponding TNFbp/OPG fusion was constructed by- 
excising the Agel /Sal I OPG fragment from p389-l and 
replacing it with Agel /Sail digested OPG PCR products 
from the above reactions. The amino acid sequences 
35 encoded by the above TNFbp/OPG contructs are shown in 
Figure 4. 
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Transient transf ections were performed in 
COS-7 cells by electroporation. Ten jig of plasmid DNA 
was electroporated into 2xl0 6 cells in 0.8 mis of DMEM. 
The electroporations were done in 0.4 cm cuvettes at 
5 1.6 kV, 25 mF and 200 ohms. The electroporated cells 
were plated in 10 -cm dishes in DMEM containing 10% FBS, 
lx glutamine/penicillin/streptomycin, lx non-essential 
amino acids, lx Na-pyruvate. The following day the 
media was changed to media containing only 1% FBS. 

10 After an additional 72 hours, the conditioned media was 
harvested and 17 \il was electrophoresed on a 12% 
denaturing, non-reducing gel. These gels were blotted 
and analyzed by western blots for the presence of 
monomer and covalently-linked dimers . The primary 

15 antibody was anti-TNFbp (R&D systems, AB-225-PB) at a 
1:1000 dilution and the secondary antibody was HRP , 
rabbit anti-goat (Pierce) at a 1:1000 dilution. 

Stable transf ections were done in CHO d- 
cells by calcium phosphate precipitation (DeClerck et 

20 al . , supra ) . The transf ection was performed as 

described except that 20 ^g of Pvul linearized plasmid 
was used with 10 jig of herring sperm carrier DNA and 10 
Hi of calcium phosphate maximizer (Clontech) to 
transfect to a 10-cm dish containing approximately 

25 5xl0 5 cells. After 2 weeks in HT- selection, colonies 
were ring-cloned and expanded into 24-well plates. 
Once confluent, two day serum-free conditioned media 
(SFCM) was prepared and analyzed for the expression of 
TNFbp/OPG fusion protein by western blot. High 

3 0 expressing clones were expanded and grown in roller 

bottles for 7d SFCM harvests. The results are shown in 
Figure 5. 
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EXAMPLE 2 

Biological Activity of TNFbp/OPG chimeric proteins 

WEHI Cytotoxicity Assay 
5 The WEHI assay is an in vitro cell 

proliferation assay (Edwards et al . Endocrinology 
128 , 989-996 (1991)). The cell lines are sensitive to 
TNF-ot (i.e., TNF-ot is cytotoxic). In the presence of a 
TNF-a inhibitor, the cells were protected from the 

10 cytotoxic effect and thus were able to proliferate. 

TNF-sensitive WEHI 164 clone 13 cells are 
suspended at a concentration of 20 x 10 4 cells/ml in 
RPMI (Gibco, Grand Island, NY) medium supplemented with 
5% Fetal Calf Serum (Hyclone) and penicillin 

15 50U/ml : streptomycin 50 mg/ml. One hundred microliters 
of this cell suspension are placed in each well of 
flat-bottomed 96-cell microtiter plates, and the cells 
are allowed to adhere for 4-6 hours at 37°C in 7% CO2 . 
Medium is then aspirated, and 0.60 mg/ml actinomycin-D 

20 (Sigma Chemical Co., St.- Louis, MO) is added to each 
well. A standard curve using serial dilutions at 
0, 0.001 0.01, 0.1, 1, 10, 100 U/ml recombinant human 
TNF is run with each assay. Serially diluted 10-fold 
concentrations of TNFbp/OPG chimeras from serum- free 

25 conditioned medium are further diluted in RPMI-1640 
medium containing 5% FBS and then added to duplicate 
wells (50 M-l/well) containing adherent WEHI 164 cells 
after the addition of recombinant mouse TNF-a. WEHI- 
164 clone 13 cells are incubated for 18 hours at 37°C 

30 in 5% CO2- Maximal killing is determined by adding 
0.02% Triton X-100 (TX-100) to test wells. After 
incubation, 7 0 jil medium are aspirated, and 50 |il of a 
1 mg/mL solution of the organic dye MTT tetrazolium 
(3- [4, 5-dimethylthiozol-2-yl]2, 5-diphenyl tetrazolium 

3 5 bromide; Sigma) is added, and cells are incubated for 
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an additional 4-6 hours. All supernatants are then 
removed, and 50 jil DMF/SDS solution (20% SDS, and 50% 
N,N dime thy lformamide, pH 4.7) is added to each well. 
The DMF/SDS solution is pipetted up and down several 
5 times until all MTT crystals are dissolved, and cells 
were incubated for an additional 2-22 hours. The 
absorbances (abs) are read on a Vmax reader at 570-650. 
The percent specific cytotoxicity is calculated from 
optical densities using the formula: % specific 
10 cytotoxicity = 100% X [abs (cells + medium) - abs (cells 
+ sample) ] /abs (cells + medium) - abs (cells + TX-100) ] . 
The number of units of TNF in each sample is determined 
using the percent specific cytotoxicities of the murine 
standards . 

15 

L929 Cytotoxicity Assay 

The L929 cytotoxicity assay is an in vitro 
cell proliferation assay (Parmely et al . J. Immunol. 
151, 389-396 (1993), the disclosure of which is hereby 

2 0 incorporated by reference) which also assesses the 
cytotoxicity of TNF-oc-sensitive killing. The cell 
lines are sensitive to TNF-ot (i.e.; TNF-a is 
cytotoxic) . In the presence of a TNF -a inhibitor, the 
cells are protected from the cytotoxic effect and thus 

2 5 survive and are able to proliferate. 

The L929 cell line was obtained from the 
American Type Culture Collection (catalog number ATCC 
CCL 1 NCTC clone 929), as described previously by 
Parmely et. al . (1993), supra . L929 cells were grown 

30 in tissue culture flasks in Dulbecco ' s MEM with 10 % 
fetal calf serum (FCS) to 80 % confluence. Cells were 
trypsinized and seeded at 8,000-10,000 cells/well in 
100 ml into Falcon #3072 96 well plates and incubated 
for 20 to 40 hours at 37 °C in 5% C0 2 . Samples of 

35 TNFbp/Fc or TNFbp/OPG [194-401] polypeptides were 
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serially diluted in medium and added in triplicate 
followed by addition of TNFct to reach a final 
concentration of 0:5 mg/ml. The cultures were incubated 
at 37 °C overnight and cell density was measured by 
crystal violet. Medium was removed by inverting the 9 6 
well plates. Cells were fixed in 100 ul 100% methanol 
for 2 minutes. After removal of methanol the plates 
were allowed to dry for 10 minutes. 100 \il of 0.10% 
crystal violet stain in 2 0% methanol was added and 
plates were Ststained for 10 minutes at room 
temperature. Excess stain was removed by inverting 
plates. Plates were washed by dipping three times in 
ice-cold distilled water and excess water was removed 
from the wells by gently blotting plates on a tissue. 
100 \il of 100% methanol was added to stained cells and 
optical density was measured at 595 nm. Media control 
reactions contained L929 cells and medium alone, and 
TNF control reactions contained L929 cells with 0.5 
ng/ml TNFct. 

The activity in this assay of TNFbp/OPG 
fusions constructed as described in Example 1 is shown 
in Figure 6 . 



★ ★ * 

25 

While the present invention has been 
described in terms of the preferred embodiments, it is 
understood that variations and modifications will occur 
to those skilled in the art. Therefore, it is intended 
3 0 that the appended claims cover all such equivalent 

variations which come within the scope of the invention 
as claimed. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Amgen Inc. 

<ii) TITLE OF INVENTION: CHIMERIC OPG POLYPEPTIDES 
(iii) NUMBER OF SEQUENCES : 87 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Amgen Inc. 
15 (B) STREET: 1840 Dehavilland Drive 

(C) CITY: Thousand Oaks 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 91320-1789 

20 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

25 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
30 (C) CLASSIFICATION: 

(Viii) ATTORNEY /AGENT INFORMATION: 
(A) NAME: Winter, Robert B. 

(C) REFERENCE /DOCKET NUMBER: A- 4 52 



(2) INFORMATION FOR SEQ ID NO : 1 : 



<i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: protein 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 

Asn Ser lie Cys 
1 

55 (2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

60 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



WO 98/49305 PCT/US98/08631 



20 



35 



40 



45 



55 



60 



39 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 

Asn Asn Ser lie Cys 
1 5 

10 (2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 

2 5 Gin Asn Asn Ser lie Cys 

1 5 

(2) INFORMATION FOR SEQ ID fsTO : 4 : 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 4 

Pro Gin Asn Asn Ser lie Cys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 
5 0 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

His Pro Gin Asn Asn Ser lie Cys 
1 5 
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30 



40 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 

lie His Pro Gin Asn Asn Ser lie Cys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



Tyr lie His Pro Gin Asn Asn Ser lie Cys 
35 1 5 10 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: protein 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Lys Tyr lie His Pro Gin Asn Asn Ser lie Cys 
15 10 

55 (2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

60 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Gly Lys Tyr lie His Pro Gin Asn Asn Ser lie Cys 
15 10 

10 (2) INFORMATION FOR SEQ ID NO: 10: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

15 (C) STRAND EDNESS ; single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

2 5 Gin Gly Lys Tyr lie His Pro Gin Asn Asn Ser lie Cys 

15 10 

(2) INFORMATION FOR SEQ ID NO: 11: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Pro Gin Gly Lys Tyr lie His Pro Gin Asn Asn Ser lie Cys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15 amino acids 
50 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Cys Pro Gin Gly Lys Tyr He His Pro Gin Asn Asn Ser He Cys 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

Val Cys Pro Gin Gly Lys Tyr He His Pro Gin Asn Asn Ser lie Cys 
15 10 15 



20 (2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

2 5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



3 5 Ser Val Cys Pro Gin Gly Lys Tyr He His Pro Gin Asn Asn Ser He 

15 10 15 



Cys 



(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 
45 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Asp Ser Val Cys Pro Gin Gly Lys Tyr He His Pro Gin Asn Asn Ser 
15 10 15 

He Cys 



(2) INFORMATION FOR SEQ ID NO: 16: 



WO 98/49305 PCT/US98/08631 



10 



45 



60 



43 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Phe Cys Cys Ser 
15 1 

(2) INFORMATION FOR SEQ ID NO : 17 : 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: protein 



3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 

Phe Cys Cys Ser Leu 
1 5 

35 (2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 

50 Phe Cys Cys Ser Leu Cys 

1 5 

(2) INFORMATION FOR SEQ ID NO: 19: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

5 Phe Cys Cys Ser Leu Cys Leu 

1 5 

(2) INFORMATION FOR SEQ ID NO: 20: 

10 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20 

Ala Gin Met Cys 
1 

(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
3 0 (B) TYPE: amino acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

Thr Ala Gin Met Cys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 22: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : S ingle 
50 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 



Gin Thr Ala Gin Met Cys 
60 1 5 

(2) INFORMATION FOR SEQ ID NO: 23 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
<B) TYPE: amino acid 
(C) STRANDEDNESS; single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 



Asp Gin Thr Ala Gin Met Cys 
15 1 5 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
2 0 (A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: protein 



3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Tyr Asp Gin Thr Ala Gin Met Cys 
1 5 

3 5 (2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 



50 Tyr Tyr Asp Gin Thr Ala Gin Met Cys 

1 5 

(2) INFORMATION FOR SEQ ID NO: 26: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



WO 98/49305 PCT/US98/08631 



15 



20 



25 



35 



40 



45 



55 



46 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

5 Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys 

15 10 

(2) INFORMATION FOR SEQ ID NO: 27: 

10 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 28: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 
3 0 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 
{ D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 28; 

Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29 



Arg Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys 
60 1 5 10 

(2) INFORMATION FOR SEQ ID NO: 30: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 



Cys Arg Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys 
15 1 5 10 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: protein 



3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys 
15 10 15 

3 5 (2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 16 amino acids 

(B) TYPE: amino acid 

40 (C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

50 Ser Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys 

15 10 15 



(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

Gly Ser Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met 
15 10 15 

Cys 

(2) INFORMATION FOR SEQ ID NO: 34: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: protein 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Pro Gly Ser Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin 
1 5 10 * 15 

3 0 Met Cys 

(2) INFORMATION FOR SEQ ID NO: 35: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNES S : s ingle 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 

Glu Pro Gly Ser Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin Thr Ala 
15 10 15 

Gin Met Cys 



(2) INFORMATION FOR SEQ ID NO: 36: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

5 

Pro Glu Pro Gly Ser Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin Thr 
15 10 15 

Ala Gin Met Cys 
10 20 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

2 0 (ii) MOLECULE TYPE: protein 



25 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Ala Pro Glu Pro Gly Ser Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin 
1 " 5 10 15 

3 0 Thr Ala Gin Met Cys 

20 

(2) INFORMATION FOR SEQ ID NO: 38: 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Tyr Ala Pro Glu Pro Gly Ser Thr Cys Arg Leu Arg Glu Tyr Tyr Asp 
15 10 15 

Gin Thr Ala Gin Met Cys 

20 

(2) INFORMATION FOR SEQ ID NO: 39: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

5 

Pro Tyr Ala Pro Glu Pro Gly Ser Thr Cys Arg Leu Arg Glu Tyr Tyr 
15 10 15 

Asp Gin Thr Ala Gin Met Cys 
10 20 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: protein 



2 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Thr Pro Tyr Ala Pro Glu Pro Gly Ser Thr Cys Arg Leu Arg Glu Tyr 
1 * 5 10 15 

3 0 Tyr Asp Gin Thr Ala Gin Met Cys 

20 

(2) INFORMATION FOR SEQ ID NO: 41: 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Phe Thr Pro Tyr Ala Pro Glu Pro Gly Ser Thr Cys Arg Leu Arg Glu 
15 10 15 

Tyr Tyr Asp Gin Thr Ala Gin Met Cys 

20 25 

(2) INFORMATION FOR SEQ ID NO: 42: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

5 

Ala Phe Thr Pro Tyr Ala Pro Glu Pro Gly Ser Thr Cys Arg Leu Arg 
15 10 15 

Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys 
10 20 25 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 2 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

2 0 (ii) MOLECULE TYPE: protein 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

Val Ala Phe Thr Pro Tyr Ala Pro Glu Pro Gly Ser Thr Cys Arg Leu 
15 10 15 

3 0 Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys 

20 25 

(2) INFORMATION FOR SEQ ID NO: 44: 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Gin Val Ala Phe Thr Pro Tyr Ala Pro Glu Pro Gly Ser Thr Cys Arg 
1 5 10 15 

Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys 

20 25 

(2) INFORMATION FOR SEQ ID NO: 45: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

5 

Ala Gin Val Ala Phe Thr Pro Tyr Ala Pro Glu Pro Gly Ser Thr Cys 
15 10 15 

Arg Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys 
10 20 25 

{2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 3 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

2 0 <ii) MOLECULE TYPE: protein 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Pro Ala Gin Val Ala Phe Thr Pro Tyr Ala Pro Glu Pro Gly Ser Thr 
15 10 15 

30 Cys Arg Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 47: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



40 



45 



50 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Leu Pro Ala Gin Val Ala Phe Thr Pro Tyr Ala Pro Glu Pro Gly Ser 
15 10 15 

Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 48: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : s i ng 1 e 
60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48 

Arg Leu Cys Ala 
1 

(2) INFORMATION FOR SEQ ID NO: 49: 



{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 



Arg Leu Cys Ala Pro 
25 1 5 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

3 5 (ii) MOLECULE TYPE: protein 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 

Arg Leu Cys Ala Pro Leu 
1 5 

45 (2) INFORMATION FOR SEQ ID NO: 51; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

5 0 (C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 



60 Arg Leu Cys Ala Pro Leu Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 52: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 

15 Arg Leu Cys Ala Pro Leu Arg Lys 

1 5 

(2) INFORMATION FOR SEQ ID NO: 53: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 

Arg Leu Cys Ala Pro Leu Arg Lys Cys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 54: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 
40 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Arg Leu Cys Ala Pro Leu Arg Lys Cys Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 55: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
60 (D) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: cDNA 



WO 98/49305 



PCT/US98/08631 



55 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 15.. 32 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 

CGCTCTAGAC CACC ATG GGC CTC TCC ACC GTG 

Met Gly Leu Ser Thr Val 
1 5 



32 



15 



20 



25 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 

Met Gly Leu Ser Thr Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



40 



45 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57; 
ACACAGGGTA ACATCTATAC CGGTGGTGCC TGAGTCCTCA G 
(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 3 . . 40 



41 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

CT GAG GAC TCA GGC ACC ACC GGT ATA GAT GTT ACC CTG TG 40 
Glu Asp Ser Gly Thr Thr Gly lie Asp Val Thr Leu 
5 1 5 10 

(2) INFORMATION FOR SEQ ID NO: 59: 

10 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Glu Asp Ser Gly Thr Thr Gly lie Asp Val Thr Leu 
20 1 5 10 

(2) INFORMATION FOR SEQ ID NO: 60: 

<i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH : 40 base pairs 

<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

3 0 (ii) MOLECULE TYPE: cDNA 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

CCTCTGTCGA CTATTATAAG CAGCTTATTT TCACGGATTG 40 
(2) INFORMATION FOR SEQ ID NO: 61: 

40 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
45 <D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

50 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 10.. 29 

5 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

TCAACCGGT AAA TGT GGA ATA GAT GTT AC 29 

Lys Cys Gly lie Asp Val 
1 5 

60 
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(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

10 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 

Lys Cys Gly lie Asp Val 
1 5 

15 (2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

<D) TOPOLOGY : linear 



25 



30 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE; 

(A) NAME / KEY : CDS 

(B) LOCATION: 11.. 28 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 



GTTTACCGGT CCT AAC TGG CTT AGT GTC 

Pro Asn Trp Leu Ser Val 
35 1 5 



(2) INFORMATION FOR SEQ ID NO: 64: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
{ D ) TOPOLOGY : 1 inear 

45 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 

Pro Asn Trp Leu Ser Val 
50 1 5 

(2) INFORMATION FOR SEQ ID NO: 65: 

<i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
< C ) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

60 (ii) MOLECULE TYPE: cDNA 



A 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 10.. 27 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

AGCACCGGT GAA CAG ACT TTC CAG CTG 27 
Glu Gin Thr Phe Gin Leu 
10 1 5 

(2) INFORMATION FOR SEQ ID NO: 66: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 <ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Glu Gin Thr Phe Gin Leu 
25 1 5 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

35 (ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME / KEY : CDS 
40 (B) LOCATION: 11.. 27 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67; 

45 GGAAAC CGGT CCG GGA AAG AAA GTG GG 27 

Pro Gly Lys Lys Val 
1 5 

50 (2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 



60 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 

Pro Gly Lys Lys Val 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 69: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 



15 



30 



45 



55 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Asn Cys Gly lie Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg Phe 
15 10 15 



Ala Val Pro Thr Lys lie lie Pro Asn Trp Leu Ser Val Leu Val Asp 
20 20 25 30 

Ser Leu Pro Gly Thr Lys Val Asn Ala Glu Ser Val Glu Arg lie Lys 
35 40 45 

25 Arg Arg His Ser Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys Leu Trp 

50 55 60 



Lys His Gin Asn Arg* Asp Gin Glu Met Val Lys Lys lie lie Gin Asp 
65 70 75 80 

lie Asp Leu Cys Glu Ser Ser Val Gin Arg His lie Gly His Ala Asn 

85 90 95 



Leu Thr Thr Glu Gin Leu Arg lie Leu Met Glu Ser Leu Pro Gly Lys 
35 100 105 110 

Lys lie Ser Pro Asp Glu lie Glu Arg Thr Arg Lys Thr Cys Lys Pro 
115 120 125 

40 Ser Glu Gin Leu Leu Lys Leu Leu Ser Leu Trp Arg lie Lys Asn Gly 

130 135 140 



Asp Gin Asp Thr Leu Lys Gly Leu Met Tyr Ala Leu Lys His Leu Lys 
145 150 155 160 

Ala Tyr His Phe Pro Lys Thr Val Thr His Ser Leu Arg Lys Thr lie 

165 170 175 



Arg Phe Leu His Ser Phe Thr Met Tyr Arg Leu Tyr Gin Lys Leu Phe 
50 180 185 190 



Leu Glu Met lie Gly Asn Gin Val Gin Ser Val Lys lie Ser Cys Leu 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 70: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 
6 0 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Lys Cys Gly lie Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg Phe 
15 10 15 

Ala Val Pro Thr Lys lie lie Pro Asn Trp Leu Ser Val Leu Val Asp 

20 25 30 



Ser Leu Pro Gly Thr Lys Val Asn Ala Glu Ser Val Glu Arg lie Lys 
15 35 40 45 

Arg Arg His Ser Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys Leu Trp 
50 55 60 

20 Lys His Gin Asn Arg Asp Gin Glu Met Val Lys Lys lie lie Gin Asp 

65 70 75 80 



lie Asp Leu Cys Glu Ser Ser Val Gin Arg His Leu Gly His Ser Asn 

85 90 95 

Leu Thr Thr Glu Gin Leu Leu Ala Leu Met Glu Ser Leu Pro Gly Lys 

100 105 110 



Lys lie Ser Pro Glu Glu lie Glu Arg Thr Arg Lys Thr Cys Lys Ser 
30 115 120 125 



Ser Glu Gin Leu Leu Lys Leu Leu Ser Leu Trp Arg lie Lys Asn Gly 
130 135 140 

Asp Gin Asp Thr Leu Lys Gly Leu Met Tyr Ala Leu Lys His Leu Lys 
145 150 155 160 

Thr Ser His Phe Pro Lys Thr Val Thr His Ser Leu Arg Lys Thr Met 

165 170 175 

Arg Phe Leu His Ser Phe Thr Met Tyr Arg Leu Tyr Gin Lys Leu Phe 

180 185 190 



Leu Glu Met lie Gly Asn Gin Val Gin Ser Val Lys lie Ser Cys Leu 
45 195 200 205 

(2) INFORMATION FOR SEQ ID NO: 71: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 08 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: protein 



60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Lys Cys Gly lie Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg Phe 
15 10 15 

5 

Ala Val Pro Thr Lys Phe Thr Pro Asn Trp Leu Ser Val Leu Val Asp 

20 25 30 

Asn Leu Pro Gly Thr Lys Val Asn Ala Glu Ser Val Glu Arg lie Lys 
10 35 40 45 

Arg Gin His Ser Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys Leu Trp 
50 55 60 

15 Lys His Gin Asn Lys Asp Gin Asp lie Val Lys Lys lie He Gin Asp 

65 70 75 80 



20 



35 



50 



60 



He Asp Leu Cys Glu Asn Ser Val Gin Arg His He Gly His Ala Asn 

85 90 95 

Leu Thr Phe Glu Gin Leu Arg Ser Leu Met Glu Ser Leu Pro Gly Lys 

100 105 110 



Lys Val Gly Ala Glu Asp He Glu Lys Thr He Lys Ala Cys Lys Pro 
25 115 120 125 

Ser Asp Gin He Leu Lys Leu Leu Ser Leu Trp Arg He Lys Asn Gly 
130 135 140 

3 0 Asp Gin Asp Thr Leu Lys Gly Leu Met His Ala Leu Lys His Ser Lys 

145 150 155 160 



Thr Tyr His Phe Pro Lys Thr Val Thr Gin Ser Leu Lys Lys Thr He 

165 170 175 

Arg Phe Leu His Ser Phe Thr Met Tyr Lys Leu Tyr Gin Lys Leu Phe 

180 185 190 



Leu Glu Met He Gly Asn Gin Val Gin Ser Val Lys He Ser Cys Leu 
40 195 200 205 

(2) INFORMATION FOR SEQ ID NO: 72: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 



<ix) FEATURE: 
55 (A) NAME / KEY : CDS 

(B) LOCATION: 1..483 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: 

GAT AGT GTG TGT CCC CAA GGA AAA TAT ATC CAC CCT CAA AAT AAT TCG 
Asp Ser Val Cys Pro Gin Gly Lys Tyr He His Pro Gin Asn Asn Ser 
1 5 10 15 
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ATT TGC TGT ACC AAG TGC CAC AAA GGA ACC TAC TTG TAC AAT GAC TGT 96 
lie Cys Cys Thr Lys Cys His Lys Gly Thr Tyr Leu Tyr Asn Asp Cys 

20 25 30 

5 

CCA GGC CCG GGG CAG GAT ACG GAC TGC AGG GAG TGT GAG AGC GGC TCC 144 
Pro Gly Pro Gly Gin Asp Thr Asp Cys Arg Glu Cys Glu Ser Gly Ser 
35 40 45 

10 TTC ACC GCT TCA GAA AAC CAC CTC AGA CAC TGC CTC AGC TGC TCC AAA 192 

Phe Thr Ala Ser Glu Asn His Leu Arg His Cys Leu Ser Cys Ser Lys 
50 55 60 

TGC CGA AAG GAA ATG GGT CAG GTG GAG ATC TCT TCT TGC ACA GTG GAC 240 

15 Cys Arg Lys Glu Met Gly Gin Val Glu lie Ser Ser Cys Thr Val Asp 

65 70 75 80 

CGG GAC ACC GTG TGT GGC TGC AGG AAG AAC CAG TAC CGG CAT TAT TGG 288 

Arg Asp Thr Val Cys Gly Cys Arg Lys Asn Gin Tyr Arg His Tyr Trp 
20 85 90 95 



25 



AGT GAA AAC CTT TTC CAG TGC TTC AAT TGC AGC CTC TGC CTC AAT GGG 33 6 

Ser Glu Asn Leu Phe Gin Cys Phe Asn Cys Ser Leu Cys Leu Asn Gly 

100 105 110 

ACC GTG CAC CTC TCC TGC CAG GAG AAA CAG AAC ACC GTG TGC ACC TGC 384 

Thr Val His Leu Ser Cys Gin Glu Lys Gin Asn Thr Val Cys Thr Cys 

115 120 125 

30 CAT GCA GGT TTC TTT CTA AGA GAA AAC GAG TGT GTC TCC TGT AGT AAC 432 

His Ala Gly Phe Phe Leu Arg Glu Asn Glu Cys Val Ser Cys Ser Asn 

130 135 140 

TGT AAG AAA AGC CTG GAG TGC ACG AAG TTG TGC CTA CCC CAG ATT GAG 480 

35 Cys Lys Lys Ser Leu Glu Cys Thr Lys Leu Cys Leu Pro Gin lie Glu 
145 150 155 160 



40 



AAT 
Asn 



(2) INFORMATION FOR SEQ ID NO:73: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 <ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Asp Ser Val Cys Pro Gin Gly Lys Tyr lie His Pro Gin Asn Asn Ser 
55 1 5 10 15 

lie Cys Cys Thr Lys Cys His Lys Gly Thr Tyr Leu Tyr Asn Asp Cys 

20 25 30 

60 Pro Gly Pro Gly Gin Asp Thr Asp Cys Arg Glu Cys Glu Ser Gly Ser 

35 40 45 



483 
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Phe Thr Ala Ser Glu Asn His Leu Arg His Cys Leu Ser Cys Ser Lys 
50 55 60 

Cys Arg Lys Glu Met Gly Gin Val Glu lie Ser Ser Cys Thr Val Asp 
5 65 70 75 80 

Arg Asp Thr Val Cys Gly Cys Arg Lys Asn Gin Tyr Arg His Tyr Trp 

85 90 95 

10 Ser Glu Asn Leu Phe Gin Cys Phe Asn Cys Ser Leu Cys Leu Asn Gly 

100 105 110 



15 



35 



40 



50 



Thr Val His Leu Ser Cys Gin Glu Lys Gin Asn Thr Val Cys Thr Cys 
115 120 125 

His Ala Gly Phe Phe Leu Arg Glu Asn Glu Cys Val Ser Cys Ser Asn 
130 135 140 



Cys Lys Lys Ser Leu Glu Cys Thr Lys Leu Cys Leu Pro Gin lie Glu 
20 145 150 155 160 

Asn 

25 (2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 705 base pairs 

(B) TYPE: nucleic acid 

3 0 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1 . .70 5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 



TTG CCC GCC CAG GTG GCA TTT ACA CCC TAC GCC CCG GAG CCC GGG AGC 
Leu Pro Ala Gin Val Ala Phe Thr Pro Tyr Ala Pro Glu Pro Gly Ser 
45 1 5 10 15 



ACA TGC CGG CTC AGA GAA TAC TAT GAC CAG ACA GCT CAG ATG TGC TGC 
Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys Cys 

20 25 30 

AGC AAG TGC TCG CCG GGC CAA CAT GCA AAA GTC TTC TGT ACC AAG ACC 
Ser Lys Cys Ser Pro Gly Gin His Ala Lys Val Phe Cys Thr Lys Thr 
35 40 45 



55 TCG GAC ACC GTG TGT GAC TCC TGT GAG GAC AGC ACA TAC ACC CAG CTC 
Ser Asp Thr Val Cys Asp Ser Cys Glu Asp Ser Thr Tyr Thr Gin Leu 
50 55 60 

TGG AAC TGG GTT CCC GAG TGC TTG AGC TGT GGC TCC CGC TGT AGC TCT 
60 Trp Asn Trp Val Pro Glu Cys Leu Ser Cys Gly Ser Arg Cys Ser Ser 
65 70 75 80 
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GAC CAG GTG GAA ACT CAA GCC TGC ACT CGG GAA CAG AAC CGC ATC TGC 

Asp Gin Val Glu Thr Gin Ala Cys Thr Arg Glu Gin Asn Arg lie Cys 

85 90 95 

5 ACC TGC AGG CCC GGC TGG TAC TGC GCG CTG AGC AAG CAG GAG GGG TGC 

Thr Cys Arg Pro Gly Trp Tyr Cys Ala Leu Ser Lys Gin Glu Gly Cys 

100 105 110 

CGG CTG TGC GCG CCG CTG CGC AAG TGC CGC CCG GGC TTC GGC GTG GCC 

10 Arg Leu Cys Ala Pro Leu Arg Lys Cys Arg Pro Gly Phe Gly Val Ala 

115 120 125 

AGA CCA GGA ACT GAA ACA TCA GAC GTG GTG TGC AAG CCC TGT GCC CCG 

Arg Pro Gly Thr Glu Thr Ser Asp Val Val Cys Lys Pro Cys Ala Pro 
15 130 135 140 



20 



40 



50 



55 



GGG ACG TTC TCC AAC ACG ACT TCA TCC ACG GAT ATT TGC AGG CCC CAC 
Gly Thr Phe Ser Asn Thr Thr Ser Ser Thr Asp lie Cys Arg Pro His 
145 150 155 160 

CAG ATC TGT AAC GTG GTG GCC ATC CCT GGG AAT GCA AGC AGG GAT GCA 
Gin lie Cys Asn Val Val Ala lie Pro Gly Asn Ala Ser Arg Asp Ala 

165 170 175 



25 GTC TGC ACG TCC ACG TCC CCC ACC CGG AGT ATG GCC CCA GGG GCA GTA 
Val Cys Thr Ser Thr Ser Pro Thr Arg Ser Met Ala Pro Gly Ala Val 

180 185 190 

CAC TTA CCC CAG CCA GTG TCC ACA CGA TCC CAA CAC ACG CAG CCA ACT 
3 0 His Leu Pro Gin Pro Val Ser Thr Arg Ser Gin His Thr Gin Pro Thr 

195 200 205 

CCA GAA CCC AGC ACT GCT CCA AGC ACC TCC TTC CTG CTC CCA ATG GGC 
Pro Glu Pro Ser Thr Ala Pro Ser Thr Ser Phe Leu Leu Pro Met Gly 
35 210 215 220 

CCC AGC CCC CCA GCT GAA GGG AGC ACT GGC GAC 
Pro Ser Pro Pro Ala Glu Gly Ser Thr Gly Asp 
225 230 235 



(2) INFORMATION FOR SEQ ID NO: 75: 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 235 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



60 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Leu Pro Ala Gin Val Ala Phe Thr Pro Tyr Ala Pro Glu Pro Gly Ser 
15 10 15 

Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys Cys 

20 25 30 

Ser Lys Cys Ser Pro Gly Gin His Ala Lys Val Phe Cys Thr Lys Thr 
35 40 45 



Ser Asp Thr Val Cys Asp Ser Cys Glu Asp Ser Thr Tyr Thr Gin Leu 
50 55 60 
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25 



45 



55 



65 



Trp Asn Trp Val Pro Glu Cys Leu Ser Cys Gly Ser Arg Cys Ser Ser 
65 70 75 80 

Asp Gin Val Glu Thr Gin Ala Cys Thr Arg Glu Gin Asn Arg lie Cys 

85 90 95 

Thr Cys Arg Pro Gly Trp Tyr Cys Ala Leu Ser Lys Gin Glu Gly Cys 

100 105 110 

Arg Leu Cys Ala Pro Leu Arg Lys Cys Arg Pro Gly Phe Gly Val Ala 
115 120 125 



Arg Pro Gly Thr Glu Thr Ser Asp Val Val Cys Lys Pro Cys Ala Pro 

15 130 135 140 

Gly Thr Phe Ser Asn Thr Thr Ser Ser Thr Asp lie Cys Arg Pro His 

145 150 155 160 

2 0 Gin lie Cys Asn Val Val Ala lie Pro Gly Asn Ala Ser Arg Asp Ala 

165 170 175 



Val Cys Thr Ser Thr Ser Pro Thr Arg Ser Met Ala Pro Gly Ala Val 

180 185 190 

His Leu Pro Gin Pro Val Ser Thr Arg Ser Gin His Thr Gin Pro Thr 
195 200 205 



Pro Glu Pro Ser Thr Ala Pro Ser Thr Ser Phe Leu Leu Pro Met Gly 
30 210 215 220 

Pro Ser Pro Pro Ala Glu Gly Ser Thr Gly Asp 
225 230 235 

3 5 (2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 420 amino acids 
<B) TYPE: amino acid 

4 0 <C) STRANDEDNESS: single 

<D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 



50 Met Gly Leu Ser Thr Val Pro Asp Leu Leu Leu Pro Leu Val Leu Leu 

15 10 15 



Glu Leu Leu Val Gly lie Tyr Pro Ser Gly Val lie Gly Leu Val Pro 

20 25 30 

His Leu Gly Asp Arg Glu Lys Arg Asp Ser Val Cys Pro Gin Gly Lys 
35 40 45 



Tyr lie His Pro Gin Asn Asn Ser lie Cys Cys Thr Lys Cys His Lys 
60 50 55 60 

Gly Thr Tyr Leu Tyr Asn Asp Cys Pro Gly Pro Gly Gin Asp Thr Asp 
65 70 75 80 
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25 



40 



55 



66 



Cys Arg Glu Cys Glu Ser Gly Ser Phe Thr Ala Ser Glu Asn His Leu 

85 90 95 

Arg His Cys Leu Ser Cys Ser Lys Cys Arg Lys Glu Met Gly Gin Val 

100 105 110 

Glu lie Ser Ser Cys Thr Val Asp Arg Asp Thr Val Cys Gly Cys Arg 
115 120 125 

Lys Asn Gin Tyr Arg His Tyr Trp Ser Glu Asn Leu Phe Gin Cys Phe 
130 135 140 



Asn Cys Ser Leu Cys Leu Asn Gly Thr Val His Leu Ser Cys Gin Glu 

15 145 150 155 160 

Lys Gin Asn Thr Val Cys Thr Cys His Ala Gly Phe Phe Leu Arg Glu 

165 170 175 

20 Asn Glu Cys Val Ser Cys Ser Asn Cys Lys Lys Ser Leu Glu Cys Thr 

180 185 190 



Lys Leu Cys Leu Pro Gin lie Glu Asn Val Lys Gly Thr Glu Asp Ser 
195 200 205 

Gly Thr Thr Gly Lys Cys Gly lie Asp Val Thr Leu Cys Glu Glu Ala 
210 215 220 



Phe Phe Arg Phe Ala Val Pro Thr Lys Phe Thr Pro Asn Trp Leu Ser 
30 225 230 235 240 

Val Leu Val Asp Asn Leu Pro Gly Thr Lys val Asn Ala Glu Ser Val 

245 250 255 

35 Glu Arg lie Lys Arg Gin His Ser Ser Gin Glu Gin Thr Phe Gin Leu 

260 265 270 



Leu Lys Leu Trp Lys His Gin Asn Lys Asp Gin Asp lie Val Lys Lys 

275 280 285 

lie lie Gin Asp lie Asp Leu Cys Glu Asn Ser Val Gin Arg His He 

290 295 300 



Gly His Ala Asn Leu Thr Phe Glu Gin Leu Arg Ser Leu Met Glu Ser 
45 305 310 315 320 

Leu Pro Gly Lys Lys Val Gly Ala Glu Asp He Glu Lys Thr He Lys 

325 330 335 

50 Ala Cys Lys Pro Ser Asp Gin He Leu Lys Leu Leu Ser Leu Trp Arg 

340 345 350 



He Lys Asn Gly Asp Gin Asp Thr Leu Lys Gly Leu Met His Ala Leu 

355 360 365 

Lys His Ser Lys Thr Tyr His Phe Pro Lys Thr Val Thr Gin Ser Leu 

370 375 380 



Lys Lys Thr He Arg Phe Leu His Ser Phe Thr Met Tyr Lys Leu Tyr 
60 385 390 395 400 

Gin Lys Leu Phe Leu Glu Met He Gly Asn Gin Val Gin Ser Val Lys 

405 410 415 
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67 



lie Ser Cys Leu 

420 

5 (2) INFORMATION FOR SEQ ID NO; 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



25 



40 



55 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 



20 Met Gly Leu Ser Thr Val Pro Asp Leu Leu Leu Pro Leu Val Leu Leu 

15 10 15 



Glu Leu Leu Val Gly lie Tyr Pro Ser Gly Val lie Gly Leu Val Pro 

20 25 30 

His Leu Gly Asp Arg Glu Lys Arg Asp Ser Val Cys Pro Gin Gly Lys 

35 40 45 



Tyr lie His Pro Gin Asn Asn Ser lie Cys Cys Thr Lys Cys His Lys 
30 50 55 60 

Gly Thr Tyr Leu Tyr Asn Asp Cys Pro Gly Pro Gly Gin Asp Thr Asp 
65 70 75 80 

35 Cys Arg Glu Cys Glu Ser Gly Ser Phe Thr Ala Ser Glu Asn His Leu 

85 90 95 



Arg His Cys Leu Ser Cys Ser Lys Cys Arg Lys Glu Met Gly Gin Val 

100 105 110 

Glu lie Ser Ser Cys Thr Val Asp Arg Asp Thr Val Cys Gly Cys Arg 
115 120 125 



Lys Asn Gin Tyr Arg His Tyr Trp Ser Glu Asn Leu Phe Gin Cys Phe 
45 130 135 140 

Asn Cys Ser Leu Cys Leu Asn Gly Thr Val His Leu Ser Cys Gin Glu 
145 150 155 160 

50 Lys Gin Asn Thr Val Cys Thr Cys His Ala Gly Phe Phe Leu Arg Glu 

165 170 175 



Asn Glu Cys Val Ser Cys Ser Asn Cys Lys Lys Ser Leu Glu Cys Thr 

180 185 190 

Lys Leu Cys Leu Pro Gin lie Glu Asn Val Lys Gly Thr Glu Asp Ser 
195 200 205 



Gly Thr Thr 
60 210 
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30 



45 
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68 



(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Met Gly Leu Ser Thr Val Pro Asp Leu Leu Leu Pro Leu Val Leu Leu 
15 10 15 



Glu Leu Leu Val Gly lie Tyr Pro Ser Gly Val lie Gly Leu Val Pro 
20 20 25 30 

His Leu Gly Asp Arg Glu Lys Arg Asp Ser Val Cys Pro Gin Gly Lys 
35 40 45 

2 5 Tyr lie His Pro Gin Asn Asn Ser lie Cys Cys Thr Lys Cys His Lys 

50 55 60 



Gly Thr Tyr Leu Tyr Asn Asp Cys Pro Gly Pro Gly Gin Asp Thr Asp 
65 70 75 80 

Cys Arg Glu Cys Glu Ser Gly Ser Phe Thr Ala Ser Glu Asn His Leu 

85 90 95 



Arg His Cys Leu Ser Cys Ser Lys Cys Arg Lys Glu Met Gly Gin Val 
35 100 105 110 

Glu lie Ser Ser Cys Thr Val Asp Arg Asp Thr Val Cys Gly Cys Arg 
115 120 125 

40 Lys Asn Gin Tyr Arg His Tyr Trp Ser Glu Asn Leu Phe Gin Cys Phe 

130 135 140 



Asn Cys Ser Leu Cys Leu Asn Gly Thr Val His Leu Ser Cys Gin Glu 
145 150 155 160 

Lys Gin Asn Thr Val Cys Thr Cys His Ala Gly Phe Phe Leu Arg Glu 

165 170 175 



Asn Glu Cys Val Ser Cys Ser Asn Cys Lys Lys Ser Leu Glu Cys Thr 
50 180 185 190 

Lys Leu Cys Leu Pro Gin lie Glu Asn Val Lys Gly Thr Glu Asp Ser 
195 200 205 

55 Gly Thr Thr Gly lie Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg 

210 215 220 



Phe Ala Val Pro Thr Lys Phe Thr Pro Asn Trp Leu Ser Val Leu Val 

225 230 235 240 

Asp Asn Leu Pro Gly Thr Lys Val Asn Ala Glu Ser Val Glu Arg lie 

245 250 255 
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Lys Arg Gin His Ser Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys Leu 

260 265 270 

Trp Lys His Gin Asn Lys Asp Gin Asp lie Val Lys Lys lie lie Gin 
5 275 280 285 

Asp lie Asp Leu Cys Glu Asn Ser Val Gin Arg His lie Gly His Ala 
290 295 300 

10 Asn Leu Thr Phe Glu Gin Leu Arg Ser Leu Met Glu Ser Leu Pro Gly 

305 310 315 320 



15 



30 



35 



45 



60 



Lys Lys Val Gly Ala Glu Asp lie Glu Lys Thr lie Lys Ala Cys Lys 

325 330 335 

Pro Ser Asp Gin lie Leu Lys Leu Leu Ser Leu Trp Arg lie Lys Asn 

340 345 350 



Gly Asp Gin Asp Thr Leu Lys Gly Leu Met His Ala Leu Lys His Ser 
20 355 360 365 

Lys Thr Tyr His Phe Pro Lys Thr Val Thr Gin Ser Leu Lys Lys Thr 
370 375 380 

2 5 lie Arg Phe Leu His Ser Phe Thr Met Tyr Lys Leu Tyr Gin Lys Leu 

385 390 395 400 



Phe Leu Glu Met lie Gly Asn Gin Val Gin Ser Val Lys lie Ser Cys 

405 410 415 



Leu 



(2) INFORMATION FOR SEQ ID NO: 79: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



Met Gly Leu Ser Thr Val Pro Asp Leu Leu Leu Pro Leu Val Leu Leu 
50 1 5 10 15 

Glu Leu Leu Val Gly lie Tyr Pro Ser Gly Val lie Gly Leu Val Pro 

20 25 30 

55 His Leu Gly Asp Arg Glu Lys Arg Asp Ser Val Cys Pro Gin Gly Lys 

35 40 45 



Tyr lie His Pro Gin Asn Asn Ser lie Cys Cys Thr Lys Cys His Lys 
50 55 60 

Gly Thr Tyr Leu Tyr Asn Asp Cys Pro Gly Pro Gly Gin Asp Thr Asp 
65 70 75 80 
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Cys Arg Glu Cys Glu Ser Gly Ser Phe Thr Ala Ser Glu Asn His Leu 

85 90 95 

Arg His Cys Leu Ser Cys Ser Lys Cys Arg Lys Glu Met Gly Gin Val 
5 100 105 110 

Glu lie Ser Ser Cys Thr Val Asp Arg Asp Thr Val Cys Gly Cys Arg 
115 120 125 

10 Lys Asn Gin Tyr Arg His Tyr Trp Ser Glu Asn Leu Phe Gin Cys Phe 

130 135 140 



15 



30 



45 



Asn Cys Ser Leu Cys Leu Asn Gly Thr Val His Leu Ser Cys Gin Glu 

145 150 155 160 

Lys Gin Asn Thr Val Cys Thr Cys His Ala Gly Phe Phe Leu Arg Glu 

165 170 175 



Asn Glu Cys Val Ser Cys Ser Asn Cys Lys Lys Ser Leu Glu Cys Thr 
20 180 185 190 

Lys Leu Cys Leu Pro Gin He Glu Asn Val Lys Gly Thr Glu Asp Ser 
195 200 205 

25 Gly Thr Thr Gly Pro Asn Trp Leu Ser Val Leu Val Asp Asn Leu Pro 

210 215 220 



Gly Thr Lys Va*l Asn Ala Glu Ser Val Glu Arg lie Lys Arg Gin His 

225 230 235 240 

Ser Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys Leu Trp Lys His Gin 

245 250 255 



Asn Lys Asp Gin Asp lie Val Lys Lys lie lie Gin Asp lie Asp Leu 
35 260 265 270 

Cys Glu Asn Ser Val Gin Arg His lie Gly His Ala Asn Leu Thr Phe 
275 280 285 

40 Glu Gin Leu Arg Ser Leu Met Glu Ser Leu Pro Gly Lys Lys Val Gly 

290 295 300 



Ala Glu Asp lie Glu Lys Thr lie Lys Ala Cys Lys Pro Ser Asp Gin 
305 310 315 320 

lie Leu Lys Leu Leu Ser Leu Trp Arg lie Lys Asn Gly Asp Gin Asp 

325 330 335 



Thr Leu Lys Gly Leu Met His Ala Leu Lys His Ser Lys Thr Tyr His 
50 340 345 350 

Phe Pro Lys Thr Val Thr Gin Ser Leu Lys Lys Thr lie Arg Phe Leu 
355 360 365 

55 His Ser Phe Thr Met Tyr Lys Leu Tyr Gin Lys Leu Phe Leu Glu Met 

370 375 380 



60 



lie Gly Asn Gin Val Gin Ser Val Lys lie Ser Cys Leu 
385 390 395 
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(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 366 amino acids 
5 (B) TYPE: amino acid 

{ C ) STRANDEDNES S ; S i ng 1 e 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

10 



15 



30 



45 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Met Gly Leu Ser Thr Val Pro Asp Leu Leu Leu Pro Leu Val Leu Leu 
1 5 10 15 



Glu Leu Leu Val Gly lie Tyr Pro Ser Gly Val lie Gly Leu Val Pro 
20 20 25 30 

His Leu Gly Asp Arg Glu Lys Arg Asp Ser Val Cys Pro Gin Gly Lys 
35 40 45 

25 Tyr lie His Pro Gin Asn Asn Ser lie Cys Cys Thr Lys Cys His Lys 

50 55 60 



Gly Thr Tyr Leu Tyr Asn Asp Cys Pro Gly Pro Gly Gin Asp Thr Asp 
65 70 75 80 

Cys Arg Glu Cys Glu Ser Gly Ser Phe Thr Ala Ser Glu Asn His Leu 

85 90 95 



Arg His Cys Leu Ser Cys Ser Lys Cys Arg Lys Glu Met Gly Gin Val 
35 100 105 110 

Glu lie Ser Ser Cys Thr Val Asp Arg Asp Thr Val Cys Gly Cys Arg 
115 120 125 

40 Lys Asn Gin Tyr Arg His Tyr Trp Ser Glu Asn Leu Phe Gin Cys Phe 

130 135 140 



Asn Cys Ser Leu Cys Leu Asn Gly Thr Val His Leu Ser Cys Gin Glu 
145 150 155 160 

Lys Gin Asn Thr Val Cys Thr Cys His Ala Gly Phe Phe Leu Arg Glu 

165 170 175 



Asn Glu Cys Val Ser Cys Ser Asn Cys Lys Lys Ser Leu Glu Cys Thr 
50 180 185 190 

Lys Leu Cys Leu Pro Gin lie Glu Asn Val Lys Gly Thr Glu Asp Ser 
195 200 205 

55 Gly Thr Thr Gly Glu Gin Thr Phe Gin Leu Leu Lys Leu Trp Lys His 

210 215 220 



Gin Asn Lys Asp Gin Asp lie Val Lys Lys lie lie Gin Asp lie Asp 

225 230 235 240 

Leu Cys Glu Asn Ser Val Gin Arg His lie Gly His Ala Asn Leu Thr 

245 250 255 
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Phe Glu Gin Leu Arg Ser Leu Met Glu Ser Leu Pro Gly Lys Lys Val 

260 265 270 

Gly Ala Glu Asp lie Glu Lys Thr lie Lys Ala Cys Lys Pro Ser Asp 
5 275 280 285 

Gin lie Leu Lys Leu Leu Ser Leu Trp Arg lie Lys Asn Gly Asp Gin 
290 295 300 

10 Asp Thr Leu Lys Gly Leu Met His Ala Leu Lys His Ser Lys Thr Tyr 

305 310 315 320 



15 



45 



60 



His Phe Pro Lys Thr Val Thr Gin Ser Leu Lys Lys Thr lie Arg Phe 

325 330 335 

Leu His Ser Phe Thr Met Tyr Lys Leu Tyr Gin Lys Leu Phe Leu Glu 

340 345 350 



Met lie Gly Asn Gin Val Gin Ser Val Lys lie Ser Cys Leu 
20 355 360 365 

(2) INFORMATION FOR SEQ ID NO: 81: 

( i ) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

3 0 (ii) MOLECULE TYPE: protein 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Met Gly Leu Ser Thr Val Pro Asp Leu Leu Leu Pro Leu Val Leu Leu 
15 10 15 

40 Glu Leu Leu Val Gly lie Tyr Pro Ser Gly Val lie Gly Leu Val Pro 

20 25 30 



His Leu Gly Asp Arg Glu Lys Arg Asp Ser Val Cys Pro Gin Gly Lys 

35 40 45 

Tyr lie His Pro Gin Asn Asn Ser lie Cys Cys Thr Lys Cys His Lys 
50 55 60 



Gly Thr Tyr Leu Tyr Asn Asp Cys Pro Gly Pro Gly Gin Asp Thr Asp 
50 65 70 75 80 

Cys Arg Glu Cys Glu Ser Gly Ser Phe Thr Ala Ser Glu Asn His Leu 

85 90 95 

55 Arg His Cys Leu Ser Cys Ser Lys Cys Arg Lys Glu Met Gly Gin Val 

100 105 110 



Glu lie Ser Ser Cys Thr Val Asp Arg Asp Thr Val Cys Gly Cys Arg 
115 120 125 

Lys Asn Gin Tyr Arg His Tyr Trp Ser Glu Asn Leu Phe Gin Cys Phe 
130 135 140 
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Asn Cys Ser Leu Cys Leu Asn Gly Thr Val His Leu Ser Cys Gin Glu 
145 150 155 160 

Lys Gin Asn Thr Val Cys Thr Cys His Ala Gly Phe Phe Leu Arg Glu 
5 165 170 175 

Asn Glu Cys Val Ser Cys Ser Asn Cys Lys Lys Ser Leu Glu Cys Thr 

180 185 190 

10 Lys Leu Cys Leu Pro Gin lie Glu Asn Val Lys Gly Thr Glu Asp Ser 

195 200 205 



15 



30 



35 



45 



60 



Gly Thr Thr Gly Pro Gly Lys Lys Val Gly Ala Glu Asp lie Glu Lys 
210 215 220 

Thr lie Lys Ala Cys Lys Pro Ser Asp Gin lie Leu Lys Leu Leu Ser 
225 230 235 240 



Leu Trp Arg lie Lys Asn Gly Asp Gin Asp Thr Leu Lys Gly Leu Met 
20 245 250 255 

His Ala Leu Lys His Ser Lys Thr Tyr His Phe Pro Lys Thr Val Thr 

260 265 270 

2 5 Gin Ser Leu Lys Lys Thr lie Arg Phe Leu His Ser Phe Thr Met Tyr 

275 280 285 



Lys Leu Tyr Gin Lys Leu Phe Leu Glu Met lie Gly Asn Gin Val Gin 
290 295 300 

Ser Val Lys lie Ser Cys Leu 
305 310 

(2) INFORMATION FOR SEQ ID NO : 82 : 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
40 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



Met Asp Ser Val Cys Pro Gin Gly Lys Tyr lie His Pro Gin Asn Asn 
50 1 5 10 15 

Ser lie Cys Cys Thr Lys Cys His Lys Gly Thr Tyr Leu Tyr Asn Asp 

20 25 30 

55 Cys Pro Gly Pro Gly Gin Asp Thr Asp Cys Arg Glu Cys Glu Ser Gly 

35 40 45 



Ser Phe Thr Ala Ser Glu Asn His Leu Arg His Cys Leu Ser Cys Ser 
50 55 60 

Lys Cys Arg Lys Glu Met Gly Gin Val Glu lie Ser Ser Cys Thr Val 
65 70 75 80 
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Asp Arg Asp Thr Val Cys Gly Cys Arg Lys Asn Gin Tyr Arg His Tyr 

85 90 95 

Trp Ser Glu Asn Leu Phe Gin Cys Phe Cys 
5 100 105 

(2) INFORMATION FOR SEQ ID NO: 83: 

<i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

15 (ii) MOLECULE TYPE: protein 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

Met Asp Ser Val Cys Pro Gin Gly Lys Tyr lie His Pro Gin Asn Asn 
15 10 15 

25 Ser lie Cys Cys Thr Lys Cys His Lys Gly Thr Tyr Leu Tyr Asn Asp 

20 25 30 



30 



50 



55 



60 



Cys Pro Gly Pro Gly Gin Asp Thr Asp Cys Arg Glu Cys Glu Ser Gly 

35 40 45 

Ser Phe Thr Ala Ser Glu Asn His Leu Arg His Cys Leu Ser Cys Ser 
50 55 60 



Lys Cys Arg Lys Glu Met Gly Gin Val Glu lie Ser Ser Cys Thr Val 
35 65 70 75 80 

Asp Arg Asp Thr Val Cys Gly Cys Arg Lys Asn Gin Tyr Arg His Tyr 

85 90 95 

40 Trp Ser Glu Asn Leu Phe Gin Cys Phe Asn Cys Ser Leu 

100 105 

(2) INFORMATION FOR SEQ ID NO: 84: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 
<B) TYPE: amino acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Met Asp Ser Val Cys Pro Gin Gly Lys Tyr lie His Pro Gin Asn Asn 
15 10 15 

Ser lie Cys Cys Thr Lys Cys His Lys Gly Thr Tyr Leu Tyr Asn Asp 

20 25 30 



WO 98/49305 



PCTAJS98/08631 



75 

Cys Pro Gly Pro Gly Gin Asp Thr Asp Cys Arg Glu Cys Glu Ser Gly 
35 40 45 

Ser Phe Thr Ala Ser Glu Asn His Leu Arg His Cys Leu Ser Cys Ser 
5 50 55 60 

Lys Cys Arg Lys Glu Met Gly Gin Val Glu lie Ser Ser Cys Thr Val 
65 70 75 80 

10 Asp Arg Asp Thr Val Cys Gly Cys Arg Lys Asn Gin Tyr Arg His Tyr 

85 90 95 



15 



25 



30 



45 



Trp Ser Glu Asn Leu Phe Gin Cys Phe Asn Cys Ser Leu 

100 105 

(2) INFORMATION FOR SEQ ID NO: 85: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 
20 <B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

Met Tyr lie His Pro Gin Asn Asn Ser lie Cys Cys Thr Lys Cys His 
15 10 15 



Lys Gly Thr Tyr Leu Tyr Asn Asp Cys Pro Gly Pro Gly Gin Asp Thr 

35 20 25 30 

Asp Cys Arg Glu Cys Glu Ser Gly Ser Phe Thr Ala Ser Glu Asn His 

35 40 45 

40 Leu Arg His Cys Leu Ser Cys Ser Lys Cys Arg Lys Glu Met Gly Gin 

50 55 60 



Val Glu lie Ser Ser Cys Thr Val Asp Arg Asp Thr Val Cys Gly Cys 
65 70 75 80 

Arg Lys Asn Gin Tyr Arg His Tyr Trp Ser Glu Asn Leu Phe Gin Cys 

85 90 95 



Phe Asn Cys Ser Leu 
50 100 

(2) INFORMATION FOR SEQ ID NO: 86; 

( i ) SEQUENCE CHARACTERISTICS : 

55 (A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

60 (ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Met Cys Thr Lys Cys His Lys Gly Thr Tyr Leu Tyr Asn Asp Cys Pro 
5 1 5 10 15 

Gly Pro Gly Gin Asp Thr Asp Cys Arg Glu Cys Glu Ser Gly Ser Phe 

20 25 30 

10 Thr Ala Ser Glu Asn His Leu Arg His Cys Leu Ser Cys Ser Lys Cys 

35 40 45 



15 



45 



Arg Lys Glu Met Gly Gin Val Glu lie Ser Ser Cys Thr Val Asp Arg 
50 55 60 

Asp Thr Val Cys Gly Cys Arg Lys Asn Gin Tyr Arg His Tyr Trp Ser 
65 70 75 80 



Glu Asn Leu Phe Gin Cys Phe Asn Cys Ser Leu 
20 85 90 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 94 amino acids 

<B) TYPE: amino acid 
<C ) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: protein 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Met Ser lie Ser Cys Thr Lys Cys His Lys Gly Thr Tyr Leu Tyr Asn 
1 5 10 15 

40 Asp Cys Pro Gly Pro Gly Gin Asp Thr Asp Cys Arg Glu Cys Glu Ser 

20 25 30 



Gly Ser Phe Thr Ala Ser Glu Asn His Leu Arg His Cys Leu Ser Cys 
35 40 45 

Ser Lys Cys Arg Lys Glu Met Gly Gin Val Glu lie Ser Ser Cys Thr 
50 55 60 



Val Asp Arg Asp Thr Val Cys Gly Cys Arg Lys Asn Gin Tyr Arg His 
50 65 70 75 80 

Tyr Trp Ser Glu Asn Leu Phe Gin Cys Phe Asn Cys Ser Leu 

85 90 
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WHAT IS CLAIMED IS: 

1. A chimeric polypeptide comprising an amino acid 
5 sequence of an osteoprotegerin dimerization domain 

fused to a heterologous amino acid sequence. 

2 . The polypeptide of Claim 1 wherein the 
heterologous amino acid sequence and the 

10 osteoprotegerin dimerization domain are human. 

3 . The polypeptide of Claim 1 wherein the 
heterologous amino acid sequence and the 
osteoprotegerin dimerization domain are from different 

15 species . 

, 4 . The polypeptide of Claim 1 covalently 
associated with one or more chimeric polypeptides which 
result in a mulitmeric polypeptide complex. 

20 

5 . The polypeptide of Claim 4 wherein the complex 
is a dimer . 

6 . The polypeptide of Claim 1 wherein the 

25 heterologous amino acid sequence is a membrane -bound 
receptor lacking functional membrane associated amino 
acid sequences . 

7 . The polypeptide of Claim 6 wherein the receptor 
3 0 is selected from the group consisting of receptor 

tryrosine kinases, cytokine receptors, seven 
transmembrane domain receptors, and cell adhesion 
receptors . 
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8 . The polypeptide of Claim 1 wherein the 
heterologous amino' acid sequence is selected from 
members of the tumor necrosis factor-like receptor 

5 family consisting of TNFR-1, TNFR-2, TNFrp, NGFR, FasB, 
CD4 0, OX40, CD27, CD3 0, and 4-1BB. 

9 . The polypeptide of Claim 8 wherein the 
heterologous sequence comprises TNFR-1 lacking 

10 functional membrane-associated sequences. 

10. The polypeptide of Claim 9 wherein the 
heterologous sequence is a 30 kDa TNF inhibitor, a 40 
kDa TNF inhibitor, or an analog thereof. 

15 

11. The polypeptide of Claim 1 wherein the carboxy 
terminus of the heterologous sequence is fused to the 
amino terminus of the OPG dimerization domain. 

20 12. The polypeptide of Claim 1 wherein the amino 

terminus of the heterologous sequence is fused to the 
carboxy terminus of the OPG dimerization domain. 

13 . The polypeptide of Claim 1 wherein one or more 
25 amino acids are inserted between the heterologous 

sequence and the OPG dimerization domain. 

14. A multimeric polypeptide comprising covalently 
associated monomers of OPG chimeric polypeptides. 



15. The multimeric polypeptide of Claim 14 which 
is a dimer . 



16. An isolated nucleic acid sequence encoding the 
35 polypeptide of Claim 1. 
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17 . An expression vector comprising the nucleic 
acid sequence of Claim 16. 

5 18. A host cell transformed or transfected with 

the expression vector of Claim 17 in a manner allowing 
expression of the nucleic acid. 

19. A pharmaceutical composition comprising the 
10 polypeptide of any of Claims 1 to 15 . 
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FIGURE 1 
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Rat: lie Lys Asn Gly Asp Gin 

Mouse: lie Lys Asn Gly Asp Gin 

Human: lie Lys Asn Gly Asp Gin 



Rat: Ala Leu Lys His Leu Lys 

Mouse: Ala Leu Lys His Leu Lys 
Human: Ala Leu Lys His Ser Lys 
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(Con't) 

Asp Thr Leu Lys Gly Leu Met Tyr 
Asp Thr Leu Lys Gly Leu Met Tyr 
Asp Thr Leu Lys Gly Leu Met His 



Ala Tyr His Phe Pro Lys Thr Val 
Thr Ser His Phe Pro Lys Thr Val 
Thr Tyr His Phe Pro Lys Thr Val 



Rat: Thr His Ser Leu Arg Lys Thr lie Arg Phe Leu His Ser Phe 

Mouse: Thr His Ser Leu Arg Lys Thr Met Arg Phe Leu His Ser Phe 

Human: Thr Gin Ser Leu Lys Lys Thr lie Arg Phe Leu His Ser Phe 

Rat: Thr Met Tyr Arg Leu Tyr Gin Lys Leu Phe Leu Glu Met lie 

Mouse: Thr Met Tyr Arg Leu Tyr Gin Lys Leu Phe Leu Glu Met lie 

Human: Thr Met Tyr Lys Leu Tyr Gin Lys Leu Phe Leu Glu Met lie 

Rat: Gly Asn Gin Val Gin Ser Val Lys lie Ser Cvs Leu 

Mouse: Gly Asn Gin Val Gin Ser Val Lys lie Ser Cvs Leu 

Human: Gly Asn Gin Val Gin Ser Val Lys lie Ser Cvs Leu 
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FIGURE 2 
3 0kDa TNF Inhibitor 

5 ' -GATAGTGTGTGTCCCCAAGGAAAATATATCCACCCTCAAAATAATTCGATTTGCTGTACC- 
+ + + + + + 

DSVCPQGKYIHPQNNSICCT- 

-AAGTGCCACAAAGGAACCTACTTGTACAATGACTGTCCAGGCCCGGGGCAGGATACGGAC- 
+ + + + + + 

KCHKGTYLYNDCPGPGQDTD- 

-TGCAGGGAGTGTGAGAGCGGCTCCTTCACCGCTTCAGAAAACCACCTCAGACACTGCCTC- 
+ + + + + + 

CRECESGSFTASENHLRHCL- 
-AGCTGCTCCAAATGCCGAAAGGAAATGGGTCAGGTGGAGATCTCTTCTTGCACAGTGGAC- 

+ + + h + + 

SCSKCRKEMGQVEISSCTV D - 

-CGGGACACCGTGTGTGGCTGCAGGAAGAACCAGTACCGGCATTATTGGAGTGAAAACCTT- 
+ h + + H + 

RDTVCGCRKNQYRHYWSENL- 

- TTCCAGTGCTTCAATTGCAGCCTCTGCCTCAATGGGACCGTGCACCTCTCCTGCCAGGAG - 
+ h + + h h 

FQCFNCSLCLNGTVHLSCQE- 

-AAACAGAACACCGTGTGCACCTGCCATGCAGGTTTCTTTCTAAGAGAAAACGAGTGTGTC- 
+ + + + + + 

KQNTVCTCHAGFFLRENECV- 

- TCCTGTAGTAACTGTAAGAAAAGCCTGGAGTGCACGAAGTTGTGCCTACCCCAGATTGAG - 

+ H + + H H 

SCSNCKKSLECTKLCLPQIE- 

-AAT-3 ' 
+ 

N * 
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FIGURE 3 
4 0kDa TNF Inhibitor 

5 ' -TTGCCCGCCCAGGTGGCATTTACACCCTACGCCCCGGAGCCCGGGAGCACATGCCGGCTC 
+ + + + + + 

LPAQVAFTPYAPEPGSTCRL- 
-AGAGAATACTATGACCAGACAGCTCAGATGTGCTGCAGCAAGTGCTCGCCGGGCCAACAT 



+ + 



REYYDQTAQMCCSKC SPGQH- 

•GCAAAAGTCTTCTGTACCAAGACCTCGGACACCGTGTGTGACTCCTGTGAGGACAGCACA- 
+ + + + + + 

AKVF C TK T S DTVCD S CED S T- 

■TACACCCAGCTCTGGAACTGGGTTCCCGAGTGCTTGAGCTGTGGCTCCCGCTGTAGCTCT- 
+ + + + + . + 

YTQLWNWVPECLSCGSRC SS- 

■GACCAGGTGGAAACTCAAGCCTGCACTCGGGAACAGAACCGCATCTGCACCTGCAGGCCC- 
+ + + + + + 

DQVETQACTREQNR I CTC RP- 

■GGCTGGTACTGCGCGCTGAGCAAGCAGGAGGGGTGCCGGCTGTGCGCGCCGCTGCGCAAG' 
+ + + + + + 

GWYCALSKQEGCRLCAPLRK- 

■TGCCGCCCGGGCTTCGGCGTGGCCAGACCAGGAACTGAAACATCAGACGTGGTGTGCAAG- 
+ + + + + + 

CRPGFGVARPGTETSDVVCK- 

■CCCTGTGCCCCGGGGACGTTCTCCAACACGACTTCATCCACGGATATTTGCAGGCCCCAC- 
+ + + + + -- + 

PCAPGTF SNTTSSTDICRPH- 

■CAGATCTGTAACGTGGTGGCCATCCCTGGGAATGCAAGCAGGGATGCAGTCTGCACGTCC- 
+ + + + + + 

Q I CNVVAI PGNASRDAVCTS- 

•ACGTCCCCCACCCGGAGTATGGCCCCAGGGGCAGTACACTTACCCCAGCCAGTGTCCACA- 
+ + +' + + + 

T S PTR SMAPGAVHL PQPV ST- 

■CGATCCCAACACACGCAGCCAACTCCAGAACCCAGCACTGCTCCAAGCACCTCCTTCCTG 
+ + + + + + 

RSQHTQPTPEPSTAPSTSFL- 

■ CTCCC AATGGGCCCCAGCCCCCCAGCTGAAGGGAGC ACTGGCGAC - 3 ' 
+ + + + + + 

LPMGPSPPAEGSTGD* 
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FIGURE 4 



1 50 

TNFbp/OPG MGLSTVPDLL LPLVLLELLV GIYPSGVIGL VPHLGDREKR DSVCPQGKYI 

TNFbp 4.0 MGLSTVPDLL LPLVLLELLV GIYPSGVIGL VPHLGDREKR DSVCPQGKYI 

TNFbp/196 MGLSTVPDLL LPLVLLELLV GIYPSGVIGL VPHLGDREKR DSVCPQGKYI 

TNFbp/217 MGLSTVPDLL LPLVLLELLV GIYPSGVIGL VPHLGDREKR DSVCPQGKYI 

TNFbp/248 MGLSTVPDLL LPLVLLELLV GIYPSGVIGL VPHLGDREKR DSVCPQGKYI 

TNFbp/304 MGLSTVPDLL LPLVLLELLV GIYPSGVIGL VPHLGDREKR DSVCPQGKYI 

51 100 

TNFbp/OPG HPQNNSICCT KCHKGTYLYN DCPGPGQDTD CRECESGSFT AS ENHLRHC L 

TNFbp 4.0 HPQNNSICCT KCHKGTYLYN DCPGPGQDTD CRECESGSFT ASENHLRHCL 

TNFbp/196 HPQNNSICCT KCHKGTYLYN DCPGPGQDTD CRECESGSFT ASENHLRHCL 

TNFbp/217 HPQNNSICCT KCHKGTYLYN DCPGPGQDTD CRECESGSFT ASENHLRHCL 

TNFbp/248 HPQNNSICCT KCHKGTYLYN DCPGPGQDTD CRECESGSFT ASENHLRHCL 

TNFbp/304 HPQNNSICCT KCHKGTYLYN DCPGPGQDTD CRECESGSFT ASENHLRHCL 

101 150 

TNFbp/OPG SCSKCRKEMG QVEISSCTVD RDTVCGCRKN QYRHYWSENL FQCFNCSLCL 

TNFbp 4.0 SCSKCRKEMG QVEISSCTVD RDTVCGCRKN QYRHYWSENL FQCFNCSLCL 

TNFbp/196 SCSKCRKEMG QVEISSCTVD RDTVCGCRKN QYRHYWSENL FQCFNCSLCL 

TNFbp/217 SCSKCRKEMG QVEISSCTVD RDTVCGCRKN QYRHYWSENL FQCFNCSLCL 

TNFbp/248 SCSKCRKEMG QVEISSCTVD RDTVCGCRKN QYRHYWSENL FQCFNCSLCL 

TNFbp/304 SCSKCRKEMG QVEISSCTVD RDTVCGCRKN QYRHYWSENL FQCFNCSLCL 

151 200 

TNFbp/OPG NGTVHLSCQE KQNTVCTCHA GFFLRENECV SCSNCKKSLE CTKLCLPQIE 

TNFbp 4.0 NGTVHLSCQE KQNTVCTCHA GFFLRENECV SCSNCKKSLE CTKLCLPQIE 

TNFbp/196 NGTVHLSCQE KQNTVCTCHA GFFLRENECV SCSNCKKSLE CTKLCLPQIE 

TNFbp/217 NGTVHLSCQE KQNTVCTCHA GFFLRENECV SCSNCKKSLE CTKLCLPQIE 

TNFbp/248 NGTVHLSCQE KQNTVCTCHA GFFLRENECV SCSNCKKSLE CTKLCLPQIE 

TNFbp/304 NGTVHLSCQE KQNTVCTCHA GFFLRENECV SCSNCKKSLE CTKLCLPQIE 

201 250 
TNFbp/OPG NVKGTEDSGT TGKCGIDVTL CEEAFFRFAV PTKFTPNWLS VL VDNL PG \ "• " 

TNFbp 4.0 NVKGTEDSGT T 

TNFbp/196 NVKGTEDSGT T. . .GIDVTL CEEAFFRFAV PTKFTPNWLS VLVDNLPG". 

TNFbp/217 NVKGTEDSGT TG PNWLS VLVDNLPG'T 

TNFbp/248 NVKGTEDSGT TG 

TNFbp/304 NVKGTEDSGT TG 

196 (OPG) 217 (OPG) 

251 300 
TNFbp/OPG VNAESVERIK RQHSSQEQTF QLLKLWKHQN KDQDIVKKII QDIDLCEN?" 
TNFbp 4.0 

TNFbp/196 VNAESVERIK RQHSSQEQTF QLLKLWKHQN KDQDIVKKII QDIDLCENS" 
TNFbp/217 VNAESVERIK RQHSSQEQTF QLLKLWKHQN KDQDIVKKII QDIDLCEM:T * 

TNFbp/248 EQTF QLLKLWKHQN KDQDIVKKII QDIDLCENcT 

TNFbp/304 

248 (OPG) 
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FIGURE 5 
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FIGURE 6 



Inhibition of TNF Cytotoxicity of L929 
Murine Connective Tissue Cells 
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